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PREFACE 


This  book  contains  the  proceedings  of  a Symposium  on  Approximation  Theory  which  was 
held  in  Austin  on  January  18-21,  1976.  The  meeting  was  attended  by  more  than  120  mathemati- 
cians. In  addition  to  the  United  States,  the  other  countries  represented  were  Belgium.  Brazil,  Cana- 
da, France,  German  Federal  Republic,  Hungary,  Israel,  Italy,  Netherlands,  New  Zealand,  Sweden, 
and  the  USSR.  The  organizing  committee  for  the  symposium  consisted  of  E.W.  Cheney,  C.K.  Chui, 
G.G.  Lorentz  (director),  and  L.L.  Schumaker. 

The  program  of  the  symposium  included  seven  one-hour  invited  expository  lectures,  more 
than  forty  one-half-hour  invited  research  talks,  and  a large  number  of  contributed  fifteen  minute 
talks.  Because  of  space  Limitations,  we  have  been  able  to  publish  only  the  invited  papers. 

The  expository  papers  describe  recent  developments  in  some  important  special  fields.  For 
example,  D.  Braess  discusses  nonlinear  approximation,  C.  de  Boor  surveys  the  central  role  of 
/(-splines  in  the  general  theory  of  splines.  “This  exhibits  the  essentially  local,  but  not  completely 
local  character  of  splines.”  R.  DeVore  discusses  the  degree  of  approximation  by  polynomials, 
trigonometric  polynomials,  and  splines,  obtaining  the  results  from  a relation  between  the  moduli  of 
smoothness  and  Peetre’s  K -functionals.  A.  G.  VituSkin  (from  the  Steklov  Mathematical  Institute  in 
Moscow)  has  a new  point  of  view  upon  entropy;  he  indicates  its  applications  to  coding.  G.  P.  Nevai 
describes  recent  results  (by  himself  , G.  Freud  and  others)  in  the  theory  of  convergence  of  interpo- 
lation formulas  and  of  orthogonal  polynomials.  C.  K.  Chui  surveys  Pade  approximations  of  arbi- 
trary analytic  functions.  L.  L.  Schumaker  discusses  fitting  surfaces;  it  is  hoped  that  this  exposi- 
tion will  be  useful  for  practical  computation.  The  shorter  research  papers  cover  a wide  variety  of 
topics  from  the  extremal  properties  of  polynomials  by  P.  Erdos  to  approximation  of  fixed  points 
by  L.  Collatz.  Several  papers  are  devoted  to  spline  theory  and  to  the  Birklioff  interpolation  prob- 
lem. 

Our  thanks  are  due  to  the  National  Science  Foundation  and  the  Air  Force  Office  of  Scien- 
tific Research  for  grants  which  made  the  symposium  possible.  The  latter  organization  provided  sup- 
port for  applied  papers  (Numerical  Analysis,  Finite  Element  Methods).  We  would  also  like  to  thank 
the  University  of  Texas  at  Austin  for  providing  secretarial  help  and  the  facilities  of  the  Thompson 
Conference  Center. 

C.K.  Chui,  G.G.  Lorentz,  L.L.  Schumaker 
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SPLINES  AS  LINEAR  COMBINATIONS  OF  B- SPLINES.  A SURVEY 


Carl  de  Boor 

This  paper  is  intended  to  serve  as  a postscript  to  the 
fundamental  1966  paper  by  Curry  and  Schoenberg  on  B-splines. 

It  is  also  intended  to  promote  the  point  of  view  that  B-splines 
are  truly  basic  splines:  B-splines  express  the  essentially  lo- 

cal, but  not  completely  local,  character  of  splines;  certain 
facts  about  splines  take  on  their  most  striking  form  when  put 
into  B-spline  terms,  and  many  theorems  about  splines  are  most 
easily  proved  with  the  aid  of  B-splines;  the  computational  de- 
termination of  a specific  spline  from  some  information  about 
it  is  usually  facilitated  when  B-splines  are  used  in  its  con- 
struction . 

1 . Introduction 

The  layout  of  the  survey  is  as  follows.  After  a short 
discussion  of  cardinal  B-splines,  i.e.,  of  B-splines  on  a uni- 
form knot  sequence,  in  Section  2,  B-splines  for  an  arbitrary 
knot  sequence  are  introduced  in  Section  3 and  shown  to  be  a 
basis  for  certain  spaces  of  piecewise  polynomial  functions. 
Various  simple  properties  of  B-splines  are  listed  in  Section  4, 
and  the  relationship  between  a spline  and  its  coordinates  with 
respect  to  a B-spline  basis  is  explored  in  Section  5.  This 
leads  naturally  into  the  discussion  of  local  spline  approxima- 
tion schemes,  in  Section  6.  Results  concerning  existence  and 
uniqueness  of  interpolating  splines  and  the  related  total  posi- 
tivity and  variation  diminishing  properties  of  B-splines  are 
presented  in  Section  7.  Section  8 describes  the  connection 
between  splines  and  certain  "best"  interpolation  schemes.  Fi- 
nally, Section  9 is  devoted  to  generalized  B-splines  and  ends 
with  a new  definition  of  polynomial  B-splines  in  many  variables 
due  to  I.  J.  Schoenberg. 
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CARL  deBOOR 


No  claim  of  completeness  is  made,  and  the  author  would  be 
grateful  to  hear  of  any  omissions. 

The  following  notation  is  used  throughout  the  paper,  usu- 
ally without  further  explanation: 

7L  denotes  the  set  of  integers,  R the  set  of  real  numbers, 


. B 


Thus,  R 


7L 


is 


and  A~  the  set  of  functions  on  B into 
the  set  of  real  bi-infinite  sequences. 

m(B)  is  the  linear  space  of  bounded  real  functions  on  B, 
normed  by  if 


= ,B 


sup  jf(x)|.  For  1 < p < oo,  IL  (I)  de- 

X€  B p 


notes  the  space  of  (equivalence  classes  of)  functions  f on 
the  interval  I for  which 


Ell  =-  t|£|'  If  |p)1/p<  «• 

P t'  > L 1 


C (I)  is  the  space  of  k times  continuously  differentiable 

k k- 1 

functions  on  1.  IL  (I)  is  the  subspace  of  those  f e C (1) 

P 

whose  (k-l)st  derivative  is  absolutely  continuous  and  whose 

ktfl  derivative  is  in  IL  (I).  M^(I)  is  the  subspace  of  ^(1) 

P nd 

whose  elements  have  an  absolutely  continuous  (k-2)  derivative 
and  a (k-l)st  derivative  of  bounded  variation.  Finally,  l (TL) 

:=  {a  c R2  | ||ai|  :=  (/..  |a.  |p)1/p  < «}. 

p ii 

P^  denotes  the  linear  space  of  all  polynomials  of  order 

k (or,  degree  < k)  with  real  coefficients.  For  a strictly 

increasing  sequence  £ *.=  (| . ) , IP.  denotes  the  linear  space 

“ 1 K , J 

of  all  piecewise  polynomial  (or,  pp)  functions  of  order  k on 
I :=  [inf  | . , sup  |^]  with  breakpoint  sequence  £.  Explicitly, 

In  addi- 


f . F iff 


(!,.! 


i+1 


) ' Fk  l(?.  ,5 


i’H+l 


)’ 


all 


i . 


tion,  f c P.  is  taken  to  have  two  values  at  | , i.e.,  the 

'1 

values  f(|^)  and  f(f^).  If  the  reader  finds  it  necessary 
to  think  of  f as  a single-valued  function,  he  should  choose 
some  rule  f(|i>  :=  Ctf^)  + (l-a)f(E^)  (e.g.,  a - 1/2)  and 

stick  with  it. 


If  v = (v^)  is  a sequence  of  nonnegative  integers 
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corresponding  to  E,  then  P,  denotes  the  linear  subspace 

v k,|,v 

of  F consisting  of  those  f e F,  for  which 
k,| 

jump  | f^V^  = 0 for  v < v^,  all  i. 


The  v-th  derivative  of  f 
well  as  by  f^V\  [x0>...,Tk]f 
difference  of  f at  the  points 


v 

is  also  denoted  by  D f as 
stands  for  the  k-th  divided 

T0,’"',Tk‘  In  Particu^ar» 


const  denotes  a constant  which  may  depend  on  the 

a, ...  ,oo 

quantities  Ot,...,oo. 


2 . Cardinal  Splines 

B-splines  made  their  first  appearance  in  Schoenberg’s 
1946  paper  on  the  approximation  cf  equidistant  data  by  analytic 
functions.  There  is  no  doubt  that  B-splines  appear  in  earlier 
literature.  They  play  a prominent  role  already  in  Favard's 
work  (35],  and  Schoenberg  has  always  maintained  that  they  were 
already  known  to  Laplace  (see  (70,  p.  68]).  But  it  is  in 
Schoenberg's  paper  that  they  were  thought  important  enough  to 
be  given  a name,  "basic  k^-order  spline  curves."  Since  this 
is  the  same  paper  in  which  Schoenberg  introduces  splines,  I 
happily  conclude  that  B-splines  were  there  at  the  very  begin- 
ning. 

Schoenberg  introduces  the  B-spline,  nee  basic  spline  curve, 
alias  spline  frequency  function  [29]  alias  fundamental  spline 
function  [71,  30] 

, x ..  , v . 1 r°°  I sin  u/2\k  iux  , 

(2.1)  Vx>  jjr  / [-J75— j e du 


and  then  observes  that 

(2.2)  M^x)  = k[-k/2,  l-k/2,  ...,  k/2]  (•  - x)*’1, 

i.e.,  M,  (x)  is  k times  the  k-th  divided  difference  in  y at 
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the  k+1  points  j - k/2,  j = 0,...,k,  of  the  function 

k-1 


k- 1 i 

(y-x)  + (max  {0,y-x})  ' ",  These  formulae  show  that 

the  k- th  convolution  power  of  the  characteristic  function  of 
the  interval  [-1/2, 1/2], 


is 


Mj(x) 


(2.3) 


1,  for  x e (-1/2, 1/2) 
0,  for  x i [-1/2, 1/2] 


M^(x)  = (M^*M.)(x)  = / (x-y)  (y)  dy  for  i+j  = k. 


Therefore, --and  this  is  why  Laplace  must  have  known  B-splines-- 
is  the  density  distribution  of  the  error  committed  on  the 
sum  of  k independent  real  random  variables  if  each  variable 
is  replaced  by  its  nearest  integer  value  [70,  p.  76]. 

It  is  easily  seen  from  (2.2)  or  (2.3)  that 


Fk,Z+k/2  r< 


„k-2 


set  of  "spline  curves  of  order  k" 


as  Schoenberg  calls  them.  The  subject  matter  of  the  paper  [ 70] 

is  the  study  of  approximations  of  the  form 

Af  :=  Z f (n)  L(*  -n)  , 
neZ 


and  the  B-splines  come  in  because  they  offer  a convenient  way 
of  expressing,  and  thereby  analyzing,  the  various  pp  "basic" 
functions  L considered  in  the  paper. 

In  the  60's,  Schoenberg's  results  were  rediscovered  and 
considerably  extended  by  those  engaged  in  studying  the  mathe- 
matical aspects  of  the  finite  element  method  (see  Aubin  [1,2], 
Babu^ka  [3],  Bramble  and  Hilbert  [19],  Fix  and  Strang  [38] 
and  Strang  and  Fix  [80],  Di  Guglielmo  [33],  and  others).  When 
restricted  to  the  one-dimensional  setting  of  Schoenberg's  papei; 
these  people  are  seen  to  consider  approximation  processes  of 
the  form 
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Af  :=  Z (Af(-  +n))L(-  -n) 
neZ 

for  some  convenient  basic  function  L,  e.g.,  L = M^,  and  some 
linear  functional  A e C*(IR)  , and  to  study  the  convergence  be- 
havior of 

Ah  :=  Sl/hASh'  With  (Saf)  M :=  f^a:x^ 


as  h ->  0 . The  results  of  this  study  are  nicely  summarized  by 
Link  [57]. 

Schoenberg  himself  developed  a particular  aspect  of  his 
'46  paper,  viz.  Cardinal  spline  interpolation,  in  considerable 
detail  in  a sequence  of  seven  papers  which  appeared  in  the  late 
60 ' s and  early  70 's.  These  papers  have  become  the  basis  for 
his  beautiful  monograph  [76]  on  cardinal  spline  interpolation. 
Readers  interested  in  the  properties  and  use  of  B-splines  on 
uniform  knot  sequences  are  urged  to  consult  that  monograph. 

3 . B-splines  Defined 


It  was  apparently  Schoenberg's  colleague  H.  B.  Curry  who 
observed  that  the  formulation  (2.2)  of  as  a k-th  order 

difference  generalizes  naturally  to  a k-th  order  divided  dif- 
ference on  arbitrary  points, 

(3.1)  Mi  k(x)  :=  kft^  . . .,  t1+kJ  (•  -x)^  l. 


The  resulting  paper  [30],  though  written  in  1946  (see  [29]),  was 

finally  published  in  1966.  The  function  M is  easily  seen 

i,  k 

to  be  a pp  function  of  order  k with  breakpoints  t 


V 


* ti+k> 

and  with  smoothness  across  each  breakpoint  t^  which  depends 
on  its  multiplicity,  i.e.,  on  the  frequency  with  which  the  num- 
ber tj  occurs  in  the  sequence  t^. Further,  one 
readily  sees  that 


(3.2)  ^(x)  >0  with  equality  if  x i (t^,  ti+fc) 
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in  case  < 


: t.  , . 
- i+k 


Now  let  t :=  (t.) 

= '1-00 


be  nondecreasing,  with 


C-oo  S=  lnf  V Coo  :=  SUP  V 


and  let  (M.  . ) °° 

N 1,  k -oo 

the  prescription 


be  the  corresponding  B- spline  sequence.  Then, 


(Zi°[iM  ) 00  ;=  ZiaiMi,k(x^  i-e •>  pointwise, 

makes  sense  for  all  x e 1 and  all  a e 1R  since,  by  (3.2), 
at  most  k of  the  terms  in  the  second  sum  are  nonzero  for  any 
given  x. 

In  a later  publication  [73]  (but  see  already  Curry's  re- 
view [28]  of  [70]),  Schoenberg  gave  these  functions  M.  , the 
name  basic  spline,  or  B- spline,  for  the  following  reason. 


THEOREM  3.1  [30].  If  t : = (t  ) ^ is  nondecreasing,  with 

t.  < t.  , and  d,  :=  card  fi  | t.  = t.],  all  i,  then  the  corre- 
i i+k  i J i 


sponding  sequence  (M^  ) of  B-splines  is  a basis  for  the 

linear  space  «&  of  all  functions  f on  ]R  which  vanish 

K,  t 

off  (t  , t ) and  which,  on  (t  , t ),  satisfy 

-00  00  - 00  00  


f 

all  i, 


i'  ti+l)  ’ 


jump  f 0 for  r < k-d^ 
ti 


in  the  sense  that  the  map 
and  onto. 


JL 


S, 


k,t' 


a »->  £,a,M,  . 

i i i,k 


is  one- one 


This  theorem  motivates  the  definition 

V !-  'Wi.k  l“i eK'  *u 

for  arbitrary  nondecreasing  t,  bi-infinite  or  not,  with  the 
sum  taken  over  all  i for  which  (t^, . . . , t ) is  a segment 
of  t.  In  particular, 
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n+k 


if  t = (ti)1  , then  ^ = < 


n 

£ aiMi  k I a 6 *“ 
i=i  *■> 


Further,  we  will  call  t the  collection  of  (polynomial) 
splines  of  order  k with  knot  sequence  t . 

COROLLARY  (Construction  of  a B- spline  basis  for  P,  ) . Let 

k>  = 

| :=  (|^)^+*  be  strictly  increasing,  v :=  (v^) ^ be  a cor- 
responding sequence  of  integers  in  [0,k]  with  v^  = v^+^  = 0, 

and  let  IP.  be  the  space  of  pp  functions  of  order  k on 

k,J,v  c — 

fil>5p+lJ  with  breakpoints  and  continuous  v-th 

derivative  at  for  v < v^,  all  i.  If 


n+k 


t (ti)1 


then  n = k + ^(k-v^)  and  the  sequence  (M^  ” of  B-spllnes 

(restricted  to  U^>£p+^])  of  order  k for  the  knot  sequence 

t is  a basis  for  P,  „ 

= k>l,Y 


For  k even,  k = 2m,  it  is  customary  to  single  out  the 
subspace  S of  so-called  "natural"  splines  in  P.  , . This 

k>|>Y 

subspace  consists  of  those  f in  F, 


and  f 


k,£>Y 


for  which  f 


(S^Sj) 


( 


are  both  of  degree  < m (see  Section  8 below 


(8.8)).  Greville  [44]  has  described  the  following  B-spline 
like  basis  for  S, 

*m+l,k*  ’ * * *^k,  k***k+l,  k*  * * ’ *^n-k,  k'^n-k+l,^  ’ * ‘ ’^n-m,  k 

with  the  special  functions  ft  , defined  as  follows: 

i,  k 
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\,k(x)  :=  Mtk+i'-"'ti+kJ<"’!>t'1  £or  1 S k 

(3.3) 

Mn_i  R(x)  :=  (-l)kk[tn_1,...,tnJ(x-.)^"1  for  i < k. 

For  a different  generalization  of  to  a "B-spline" 

with  multiple  knots  (which  are  otherwise  uniformly  spaced) , see 
Schoenberg  and  Sharma  [77]  and  Lecture  5 of  Schoenberg's  mono- 
graph [76].  Certain  technical  assumptions  made  by  them  in 
their  construction  have  recently  been  removed  by  Lee  [56], 


4 . Simple  Properties  of  the  B-spline 

In  this  section,  we  list  some  simple  properties  of  the  B- 

spline,  some  of  which  are  enlarged  upon  in  subsequent  sections. 

The  definition  of  M as  a divided  difference  together  with 

I,  k 

Taylor's  formula  with  integral  remainder  readily  imply  that, 
for  t,  < tj 


'i+k* 


00 


(4.1)  [ti,  ..  .,t1+k]f  = / M^k(s)  f (s)  ds/k! 


all  feljlt^g 


In  particular, 

(4.2)  / k ( s)  ds  = 1. 

This  shows  that,  on  [t,,t.  ], 

’ V i+k 

x 

(4.3)  0(x)  k(s)ds 


is  a spline  of  order  k+1  with  knots  t,....,t.  , and  rises 

strictly  monotonely  from  a value  of  0 at  t^  (and  to  left  of 

t ) to  a value  of  1 at  t,  . (and  to  the  right  of  t ) . 

1.  1.  "He  1 "f  R 

This  function  is  therefore  useful  in  constructing  piecewise 
monotone  spline  interpolants  as  is  done  in  Passow  [66],  but 
without  having  to  resort  to  multiple  knots  as  he  does.  One 
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obtains  his  construction  as  a special  case  by  letting  half  the 
t.'s  equal  t^  and  the  other  half  equal  t^+k<  Use  0 also 
produces  a very  quick  proof  that  splines  have  property  SAIN 
with  respect  to  interpolation  at  a given  set  of  points  and  the 
uniform  norm  (see  Chui,  Rozema,  Smith,  and  Ward  [24],  who  use 
(4.3)  in  the  form  (4.11)).  Because  of  its  local  and  monotone 
character,  0 has  also  been  instrumental  in  DeVore's  successful 
investigation  [32]  of  the  order  of  approximation  to  smooth  mono- 
tone functions  by  monotone  splines. 

It  seems  more  convenient  in  computations  to  use  the  nor- 
malized B-spline 


(4.4)  N^k(x)  : ( ^i+i*  * * '>  Ci+k^  ' ^t’  * * ’*  ti+k-l^  ^’”X^ 

= <Wti)Mi,k(x)/k' 


k-1 

+ 


since  it  insures  (see  (5.8)  below)  that 


(4'5)  J/V  “ 1 on  "k’W- 


Note  that  then 


(4,6)  Ni,k  Mi,k-1  ' Mi+l,k-l 


k-1 


k-1 


’i+k 


tt  “i,k-l 


N. 


t.  ,-t.  , ”i+l, k-1* 
i+k  i+1  * 


If  one  follows  [8]  and  applies  Leibniz'  formula 


(4.7)  [ 


j 


t^,  . . . , t ] (fg)  - l_,  [t^,...,t^]f  [i^>  • • •> 

J r=i 


Jg 


for  the  divided  difference  of  a product  to 
(•  -x)  } = (•  -x)|  1 (*  -x) 

and  notes  that  all  divided  differences  of  (r  -x)  of  order  > 1 
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vanish,  then  one  obtains  the  recurrence  relation 
(4.8)  [ti,...,t1+k](.-x)j  = 


— [t  t ] + 1— ft 

t,  , -t.  '•*,ti+k-lJ  - -- 


x - t 
:i+k_ti 

which  in  turn  implies  that 


WV'1^ 


,,t:i+k] 


(•-*)+  1> 


<*-9>  i^rNij)k(x)  =r — rrNijU(x)  +~r^i — *iii,k-i(x) 

1,k  i+k-1  i ’ i+k  i+1  * 


For  j = 0,  this  recurrence  was  found  by  the  author  [8]  and  by 

L.  Mansfield,  and  by  Cox  [27]  who  proved  it  by  a different 

argument  and  for  distinct  knots  only,  and  gave  a backward  error 

analysis  in  that  case  for  the  evaluation  algorithm  based  on  the 

recurrence.  The  recurrence  provides  a scheme  for  the  stable 

evaluation  of  B-splines  since,  on  the  interval  (t^, t ^)  °f  *-n_ 

terest,  i.e.,  on  the  support  of  N.  , , both  weights  in  (4.9) 

i,  K 

are  positive.  This  observation  also  allows  us  to  establish,  by 
induction  on  k,  that 

(4.i0)  Nlk>0  on 

Similar  recurrence  relations  for  the  integral  of  a B- spline 
have  been  given  by  Gaffney  [39],  and  for  the  integral  of  pro- 
ducts of  B-splines  by  Lyche,  Schumaker,  and  the  author  [17]. 

In  this  connection,  we  note  that 

x i+r 

(4-u>  ! Ml,k(8)dS  - ,2Ak+iw  £or  x — 'i+r+l * 

-00  ’ j=i 

B-splines  are  convenient  for  relating  splines  with  multi- 
ple knots  to  splines  with  simple  knots  and  vice  versa  (e.g., 

[7],  Rice  [68],  Burchard  [23],  also  the  paper  by  P.  Smith  in 
these  proceedings),  since  a B-spline  is  a continuous  function 
of  its  knots,  within  reason.  Specifically,  writing 
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V *"'ti+k 


: = n 


i,k 


to  stress  the  dependence  of  ^ on  its  knots  t^, . . . , t^+^> 
the  map 

(t  ,)k  »-»  N 
j 0 


is  continuous  as  a map  from  {t  e +1lTo  - •••  - V 

t < t } to  COR):  it  is  also  continuous  as  a map  from 

j j+k-1  — ‘ 

{ T e Fk+  1 | Tq  < . . . < xQ  < Tk)  to  ]Lp (JO  for  every  1 < p < oo. 

The  precise  behavior  of  N near  the  boundary  of  its 

i,  k 

support  can  be  read  off  directly  from  its  definition  as  a di- 
vided difference.  Since 

k-1  , .k  .k-1  .k-1 

(y-x)+  - (-)  (x-y)  + = (y-x)  , 

and  the  k-th  divided  difference  of  a polynomial  of  order  k 

vanishes,  one  can  write  N , also  in  the  form 

i,  K 

(4.U)  »1)kW-Hk([tltl tltkI-[v...,«ltk.1lH*-,1:'1, 

From  this,  one  infers  at  once  that,  e.g.,  for  x near  t^, 

k-r+L 


(4.13) 


k”  r 

..  / \ . ^ . k-  r -rr  k-j  ,,  .K-r+i. 

N (x)  » <*-t  > + TT  J_,  + 0( (x- 1 ) + ) 

* J=1  i+k-j  i 


if  t.  = t,  . < t.  , hence 
i i+r-1  i+r' 


(4.14)  N.  (x)  = 

If  t,  < ...  < t 


0((x-t  ) N (x)) 


r i,k 


for  j > i as  x -»  t 


0((x-ti+k)Ni,k(,0)  for)<1  *s  X-W 


. , , then  N.  , has  a zero  of  order  k-1  at 
i+k*  i,k 

by  (4.13  and  also  a k-1  fold  zero  at  t^+^  by  symmetry.  This 
implies  that 
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(4.15)  0 = N 


(j-r) 

i,k 


'i+k 


<tj+k)  ‘ { 


(ti+k-s>r'MA(s)ds/(r-1):' 

r = l,...,j)  j = l^...^k~l. 


,(j) 


showing  that  N is  orthogonal  to  P.  on  [t^,  j = 1, 

(This  fact  was  pointed  out  to  me  in  1973  by  H.  G. 

Burcha  rd .) 

5 . The  B-spline  Series 
In  this  section  the  relationship 


(5.1) 


z.a.N.  «-»a 

iii 


between  a spline  and  the  sequence  of  its  B-spline  coefficients 
(with  respect  to  the  normalized  B-splines)  is  discussed.  Fur- 
ther aspects  of  this  relationship  will  be  mentioned  in  subse- 
quent sections.  From  here  on,  we  suppress  the  subscript  k 
in  N.  , and  M.  except  when  necessary.  Also,  we  restrict 

1 y K 1 y K 

the  knot  sequence  t to  be  bi-infinite  in  order  to  avoid 
(mostly  notational)  complications.  This  is  no  essential  re- 
striction since  any  spline  can  always  be  extended  to  a spline 
with  a bi-infinite  knot  sequence  merely  by  adding  to  its  expan- 
sion appropriate  B-splines  with  zero  coefficients. 

A B-spline  series  may  be  differentiated  by  differencing 
the  coefficients.  Precisely,  repeated  application  of  (4.6) 
gives 


(5 . 2a) 
with 


(j) 


j 


r 


(5.2b)  a 


(j) 


a. 


wv/(kri ' 


j = 0 


j >0 


The  recurrence  relation  (4.9)  (with  j = 0)  allows  one  to 
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express  a B- spline  series  as  a series  of  lower  order,  but  with 
polynomial  coefficients.  Precisely, 


(5.3a)  Z^N^Cx)  = ^;JJ(x)N.a..(x) 


(5.3b)  : = 


, j 0 


(x-ti)a{j-1J(x)+(ti+k  -x)a|j;1]  (x) 


t,  , . - t. 

i+k-j  l 


, j >0. 


In  particula 

[k-1] 
r,  al± 

is 

agrees  with 

2iaiNi,k 

on 

to  evaluate 

2iVi,k 

at 

is  a polynomial  of  degree  < k which 


of  averages,  starting  with  the  k numbers  Oi.  , ...,Q£.  (see 
the  first  algorithm  in  [8  J)  . 

The  quasi-interpolant  of  Fix  and  the  author  [16]  provides 
an  ofttimes  convenient  means  for  computing  the  B-spline  coeffi- 
cients of  a given  spline.  The  quasi-interpolant  makes  use  of 
the  linear  functional  given  by  the  rule 

<5'4)  V V,*  kf 

i'  i,k  j < k ’ 


Here, 


V00  (ti+rx)'---'(ti+k-rx)/(k-l): 


and  t,  is  an  arbitrary  point  in  (t.,t.  ,).  Then,  as  one 
verifies  directly  [16], 

(5.5)  A N . = 5.  .,  all  j. 
i j i,  y 

Since  has  support  at  a point  only,  it  follows  that 

A.  (E.a.N.)  = aJ . 

1 J J J 1 

The  usefulness  of  this  functional  was  demonstrated  in  [9], 
For  instance,  it  provides  a quick  proof  of  Theorem  3.1  and  its 
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corollary.  As  another  instance,  it  provides  a quick  proof  of 
the  fact  due  to  Curry  and  Schoenberg  [30]  that  B-splines  are 
splines  of  minimal  support:  If  £ e S,  has  its  support  in 

(t  ,t  ) and  s < k,  then,  for  each  i,  one  can  choose  t. 
r’  r+s  ’ ’ ’ i 

in  ^i' ti+k^tr,tr+s*'  hence  then  \f  = °>  a11  L> 
f = 0. 

More  generally,  one  obtains 


LEMMA  5.1.  ^f  t^  < t^+k,  all  i,  then  supp = 

U supp  N. . 

d^O  1 


In  order  to  compute  the  coefficients  of  specific  splines, 
we  observe  that,  for  f,  i|r  e I>k,  a(r)  :=  A^  ^f  is  constant  as 

a function  of  t,  as  is  clear  from  the  fact  that  CK ' (t)  = 
iKr)f^(T)  - (")  (t)  f (t)  . Hence,  with  t = y,  we  get  that 


\(y-*)k  1 = Ay^  (y- •) k 1 = ^k(y)  (-) k_  1 (k- 1) ! . 
i,k 


This  shows  that 

(5.6)  (y-x)k_1  = 21(y-t1+1)-...-(y-titk.1)NlkW, 

which  is  Marsden's  identity  [61],  More  generally, 

\(y-Ok’P/(k-p)  1 = (-)P"1^^Pk1)  (y)  (-)k"P, 
so 

(5.7)  (y-x)k"p/(k-p)!  = (-)k'V51)(y)Ni,k(x)' 

and,  in  particular,  with  p = k, 

(5.8,  1 ■ ¥(>k. 

Of  course,  all  these  identities  hold  on  (t  , t ) only. 

— ocr  oo 

obtains  similarly  that 


One 
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(5.9)  (y-x)^"1  = Zi(y-ti+1)  (y_ti^k-l)+Ni,k(x)'  foryeI- 

For  the  uniform  knot  sequence  t -7L  and  for  k = 4,  one  can 
find  (5.6)  and  (5.9)  already  in  Schoenberg  [70]. 

I 

’ Identities  (5.6)  and  (5.9)  illustrate  a point  to  be  made 

repeatedly  in  this  survey,  viz  how  closely  a spline  function  is 
modelled  by  its  B-spline  coefficients.  To  elaborate  on  this 
point  a little,  note  that,  with  t : = (t^)q  anY  subsequence  of 
► t,  (5.9)  implies  that 

(5.10a)  k[T0,...,rk](.-x)1J'1  = ZiaT(i)M^k(x) 

where 

(5.10b)  c^(i)  :=  [tq,  ■ . TkJ  (■'ti+1)+‘  • • • ’ ^’"ti+k-l^  + 

> 0 

This  supplies  the  formula 

(5.11)  [Tq,  * ’ * ’ = • • • > ^ 

for  the  k-th  divided  difference  at  some  points  in  terms  of  the 

k-th  divided  differences  at  the  points  of  a refinement  of  those 

points,  with  the  coefficients  nonnegative.  The  existence  of 

such  a formula  with  nonnegative  weights  Ot was  already  known 

to  Favard  [35].  The  formula  is  clearly  a discrete  analog  of 

(4.1),  and  a deserves  to  be  called  a discrete  B-spllne  with 

knots  T.  Indeed,  has  been  called  just  that  by  Schumaker 

[79]  in  the  special  case  when  t is  uniform,  t^  = t^  + ih, 

all  i.  In  that  case,  if  f e A has  only  the  active  knots 

k,  t 

t^  , . . . , ti  and  f = E a (i)N  k,  then  0^  is  a discrete 
*0  r * 

spline  of  order  k with  knots  in  the  sense  of 

Mangasarian  and  Schumaker  [60],  This  means  that,  for  each  j, 

af (i)  is  a polynomial  of  order  k in  i on  i^-k<i<ij+^. 

It  should  be  said,  though,  that  Mangasarian  and  Schumaker  did 
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not  view  discrete  splines  in  this  light  as  B-spline  coefficients 
of  continuous  splines.  They  arrived  at  discrete  splines  as  the 
solution  of  certain  discrete  minimization  problems. 

The  size  of  the  i-th  B-spline  coefficient  of  a spline  is 
closely  tied  (at  least  for  moderate  k)  to  the  size  of  that 
spline  "nearby,"  i.e.,  on  (t.,t.  ),  as  can  be  proved  [9]  with 

1.  1 ■*'K 

the  aid  of  the  linear  functional  (5.4).  Slightly  more  refined 
arguments  produce  the  following  explicit  result. 


THEOREM  5.1  [13] . 
property  that  for 


Let  D, 
k 

eve  ry  t, 


i+k-1 


be  the  smallest  number  with  the 
eve  ry  i,  and  every  a < b with 


there  exists  h.  e IL  such  that 

i oo  

(5.12)  supp  h.c  [a,b],  hj[  < Dk/(b-a),  /h^.  = 6i.,  all  j. 

Then  (tt/2)  k/2  < DR  < 2k  9k'L. 

Numerical  evidence  presented  in  [13]  strongly  suggests 

that  actually  D.  ~ 2 . 

k 

The  theorem  implies  that 


(5 , 


1.  X.  tK 

which  leads  to 

THEOREM  5.2  [9].  Let  E be  the  diagonal  matrix  ]"•••> 
(t^^-t  )/k,  ...J.  Then 


D“1||E1/pa||p  < Hz^N^i  < i|E1/pC(!|p,  allae^,  l<p<«  . 

In  particular,  Z^Ct^N^  e ]Lp  if  and  only  if  E ^Pa  e £^(2Z)  . 

The  proof  of  the  upper  bound  for  ||Z^a^N^||  makes  use  of 
the  fact  that  the  N^'s  are  nonnegative  and  sum  up  to  1 while, 
by  (4.2)  and  (4.4),  / ^ = (t^- t^)  /k. 


l/P, 


JL 
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COROLLARY  1 [9].  Let  k p :=  <k/ (ti+k‘ Ci^ 1 Pwt  k’  — 

1 < p < «,  (N.  ) is  a Schauder  basis  for  A fi  IL  (JR)  . 

i,k,p  p 

We  note  the  estimates 

(5.14)  k1/p/k  < N < 1. 

f > p p 

COROLLARY  2 [12],  Let  t be  finite,  infinite,  or  bi-infinite, 
let  G :=  (/ N.  , ,N.  J,  and  let  G_1  = (a..).  Then  G_1 

decays  exponentially  away  from  the  diagonal.  Explicitly, 

ja  ! < const  q ^ 1 

with  q = (l-D^)1^^  ^ e (0,1)  and  const  = D^/q^  1 both 
depending  only  on  k and  not  on  t. 

This  corollary  was  proved  earlier  for  a finite  uniform  t 

by  Domsta  [34],  and  then  used  by  Cielsielsky  and  Domsta  [26 J 

k-  2 d 

in  the  construction  of  a basis  for  C [0,1]  which  is,  at 

k-  2 d 

the  same  time,  also  a basis  for  1L^  [0,1]  for  l < p < «=. 

The  corollary  was  used  in  [12]  for  a somewhat  related  purpose, 

viz  in  order  to  show  that  least- squares  approximation  from 

S,  , considered  as  a map  on  ]L  , can  be  bounded  in  terms  of 
k,  t*  p' 

the  global  mesh  ratio 

M :=  sup.  .(t.  -t,)/(t.  ,-t.). 

t *i,  j'  i+k  V v j+k  j 


COROLLARY  3 [7],  [13].  Let  m^  :=  fl  mQR)  be  the  sub- 

space of  bounded  splines  of  order  k with  knot  sequence  t. 
Then  the  rule  0£  #->  Z^CC^N^  maps  £ (2Z)  onto  m«9^  . Further, 

with  0:  Z,r(2Z)  -»  m^  t:  a i->  Z^a^N^,  the  condition  (number) 

cond  :=  ||0  ||0  V of  the  basis  (N  ) for  m«&  is  bounded 

Kj  L 1.  K j L 

by  independent  of  t . 
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Since  (D^ D^D^, D^, . . .)  < (1,  2.5,  5.3,  10.1,  ...),  thia 
shows  the  B-spline  basis  to  be  well  conditioned,  independent  of 
t,  for  "small"  k. 

Finally,  for  another  illustration  of  the  fact  that  B-spline 
coefficients  "model"  the  function  they  represent,  observe  that, 
for  the  particular  choice 


(5.15)  t t - Tt  :=  (t1+1  + 


the  coefficient  of  f 

V = f(Ti)  + bt 


(1) 


(Ti} 


+ '!*-!> /<k-l>' 


in  (5.4)  vanishes.  Then 


with 


j=2  LK 


j)  (Tj)f(j)(T*) 


< const,  (max  At  ) 2 max  Ilf  ^ II  . 
k r 2§j<k 

Therefore,  if,  e.g.,  f is  a fixed  spline  with  Ilf  ^ ||  < oo 

"oo 

for  2 < i < k,  and  we  write  f as  a linear  combination  of  B- 
splines  on  a knot  sequence  t which  refines  the  knot  sequence 
for  f,  then  the  resulting  B-spline  coefficient  sequence  OL  for 
f satisfies 


= f(T*)  + 0(max  (Atr)2). 


6.  Local  Spline  Approximation 

Because  of  their  local  support,  B- splines  have  been  in- 
strumental in  the  construction  of  local  spline  interpolation 
and  approximation  schemes.  In  such  a scheme,  the  approximation 
is  taken  in  the  form 


(6.1)  Af  :=  £i(u1f)Ni 

with  p a linear  functional  with  support  in  supp  N = (t  , t .). 

1 11  ' Iv 
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Since  then  (Af)  I.  „ * depends  only  on  f I 

,(tj'tj+l)  l(tj+l-k'VkJ 

such  an  approximation  scheme  is  capable  of  reflecting,  and  tak- 
ing advantage  of,  the  local  behavior  of  f. 


LEMMA  6.1.  If  A reproduces  PR  on  (t_^,  t ),  then 


(6.1) 


f - Af' 


(t  t s < (SUP,  ) dist-  (t  t \ 

(tj,tj+l)  1 1 * (tj+l-k>  tj+k) 


(f,V- 


4 


4 

4 


The  condition  that  A reproduce  is  certainly  satis- 

fied in  case  A is  a projector.  This  will  happen  iff  (^)  is 
dual  to  (Nt),  i.e.,  = 6^,  all  i,  j.  In  such  a case, 

Af  interpolates  f at  (n  ) in  the  sense  that  L^Af  = ^f, 
all  i.  A linear  functional  satisfying 

(6.2)  supp  ut  E SUPP  = (ct>  ti+k>  > a11  h 

seems  to  have  been  constructed  for  the  first  time  in  [5],  for 
the  purpose  of  demonstrating  the  linear  independence  over  an 
interval  of  all  B-splines  which  do  not  vanish  identically  on 
that  interval.  Since  then,  such  linear  functionals  have  been 
constructed  in  various  ways  and  for  a variety  of  jobs.  A sum- 
mary and  detailed  discussion  is  given  in  [13], 

The  first  local  spline  interpolation  scheme  seems  to  have 
been  Birkhoff's  local  spline  approximation  by  moments  [4],  A 
corrected  and  extended  version  can  be  found  in  [6],  The  scheme 
was  not  given  in  the  form  (6.1).  It  was  therefore  somewhat  of 
a surprise  to  find  that  local  spline  approximation  by  moments 
is  a special  case  of  the  quasi-interpolant  of  Fix  and  the  author 
[16],  i.e.,  of  the  form  (6.1)  with  4^  = given  by  (5.4) 

Wlth  Ti  = Ci+k/ 2’  311  1* 

The  quasi-interpolant  approximates  well  to  f and  its 
first  k-1  derivatives,  but  requires  values  of  f and  of  its 
derivatives  for  its  construction.  An  earlier  scheme  [7] 
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constructs  4^  involving  only  function  evaluations,  and  satis- 
fying even 


(6.3)  supp  (ci+1>  > ^iNj  = 6ij’  a11 


and  so  that  sup . ||m-^  I!  < <».  This  is  possible  since  it  can  be 
shown  that 

Dk  a,  :=  sup  SUP  |u  eC*fti+i'  ti+k-l^  pNi  = &ii'  311  ^ 

' °°  t i J J 

(6.4)  = sup  sup  1/dist  r i (N  , span(N  ) , ) 

t i [ti+l'WlJ  1 3 

is  finite.  In  fact,  it  follows  from  Theorem  5.1  that  D,  < D. 

k,  oo  — k 


< <m.  Therefore,  one  finds  that,  for  this  scheme, 

(t  t )(f^Pk) 

’ Uj+2-k'  j+k-l; 


(6.5)  | f-Af  , _ . < D dist 

11  °°,  (tj,tj+1)  “ k>°° 


But  it  is  not  clear  how  well  the  derivatives  of  Af  approximate 
those  of  f.  Also,  A is  not  applicable  to  arbitrary  felL^. 

The  latter  objection  can  be  overcome  by  choosing  4^  of 
the  form 


= /f  h., 

with  h,  e L [t  ,t.  , ] chosen  as  in  Theorem  5.1  to  satisfy 
1 00  1 i +k 

(5.12).  The  resulting  linear  projector  P, 

(6.6)  Pf 


is  local  and  is  bounded  as  a map  on  £,  by  D,  for  each 

p k 

p e [l,oo]  and  independently  of  t [11].  But,  in  order  to  ob- 
tain also  good  approximations  to  derivatives  (regardless  of  t, 
i.e.,  without  recourse  to  Markov's  inequality),  Lyche  and  Schu- 
maker  [59]  found  it  necessary  to  give  up  the  condition  that  Af 
interpolate  f and  to  revert  to  the  weaker  condition  that  A 
merely  reproduce  P^.  Such  local  approximation  schemes  have 
been  further  investigated  by  Demko  [31], 
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An  important  local  spline  approximation  scheme  (which  only 
reproduces  P^)  is  Schoenberg's  variation  diminishing  spline 
approximation.  It  will  be  discussed  in  the  next  section. 

The  use  of  local  spline  approximation  schemes  for  gauging 
accurately  the  degree  of  approximation  by  splines  is  further 
pursued  in  DeVore's  contribution  to  these  proceedings. 

We  close  this  section  with  the  remark  that  the  dual  to  the 
linear  projector  P in  (6.6),  i.e.,  the  linear  projector  P1 
given  by 

(6.7)  P'g  :=  Zi(/fN1)hi, 


is  helpful  in  settling  two  questions  of  "smooth"  interpolation. 
The  first,  raised  originally  by  Schoenberg  [74J  and  partially 
answered  by  Golomb  [42],  concerns  the  existence  of  g e ]L  (F) 


]R' 


7L 


and 


which  satisfies  g(t^)  = 0!^,  all  i,  for  a given  O.  e 

a given  t (t^)  taken  strictly  increasing  for  simplicity. 

Let  [t.,...,t.  . ]a  be  the  k- th  divided  difference  of  the  data 

at  t^, ...,t  k 3n<*  reca^  t^ie  diagonal  matrix  E : ]”..., 

(t.  ,-t  )/k, ...|  of  the  preceding  section.  Then  it  is  easily 
x+k  i + I 

seen  that  having  E vft.,...,t  , ])  in  & is  a necessary 

1.  1 p 

condition  for  the  existence  of  such  a g.  To  see  that  this  is 
also  a sufficient  condition,  observe  [13]  that  the  function  g, 
given  by  the  conditions  that  g(t^)  Oi^,  i l,...,k  and  that 

(6.8)  g(k)  = (k- 1)  ! Z1([t1,...,t1+k]oO(t1+k-t1)h1, 

is  in  ]Lk  by  Theorem  5.1  in  case  E^^(  [t t.  ])  e £ , 
p 1 ' i'  ’ i+k  p' 

and  agrees  with  a at  t since,  by  (4.2),  it  has  the  same 

k-th  divided  differences  at  the  points  of  t as  does  a. 

The  particular  interpolant  g to  the  given  data  a at  t 

just  constructed  has  the  property  that,  on  at  most 

k of  the  h^  in  (6.8)  are  not  zero,  while,  by  Theorem  5.1, 

||h  ||  (t  -t  ) S Di,>  all  This  proves  [13]  that,  for  given 

X ® X tK  X K 
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* 

t and  given  Ct,  there  exists  g e TL^  so  that  g l = a and, 

for  all  t . < t . - , 

J j+1 

b«L  rr  t i < const  max  kl|[t  , . . . , t | 

for  some  const  < D^.  This  answers  a question  by  H.-O.  Kreiss 
as  to  the  existence  and  the  size  of  such  a const. 


7 . Total  Positivity  and  the  Variation  Diminishing 
Properties  of  B-splines 

The  strict  positivity  of  N on  (t.,t.  ) (see  (4.10)) 

1.  y K X 1 +K. 

is  a particular  instance  of  the  Schoenberg-Whitney  theorem  and 
the  variation  diminishing  properties  of  B-splines,  the  subject 
of  this  section.  A thorough  discussion  of  these  matters  in 
the  more  general  context  of  Chebyshev  splines  can  be  found  in 
Chapter  10  of  Karlin's  book  on  total  positivity  [47]. 

Throughout  this  section,  the  knot  sequence  is  taken  to  be 
finite, 

n+k 

t = (t^)^  > nondecreasing  with  t^  < t +^,  all  i, 

and  (N^,)^  is  the  corresponding  sequence  of  B-splines  of  order 

k.  A has  then  dimension  n.  We  consider  spline  interpo- 
k,  t 

lation  at  points  < ...  < t^.  This  amounts  to  finding,  for 
given  f,  a e R so  that 
n 

(7.1)  £ a N (t  ) = f(T  ),  i = l,...,n. 

j=l  J J 1 1 


The  question  of  existence  and  uniqueness  of  such  an  interpolant 
was  settled  some  time  ago. 


THEOREM  7.1 
(7.2)  S : = 


(Schoenberg-Whitney  [78]).  Let 

k n . . 

< £ a xJ  + £ a,  (x- t •)  + 

j=l  J j=k+l  J J 


a e R 


n l 
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with  t,  ,<...<  t . If  T,  < ...  <T  , then  S contains, 
k+1  n — 1 n7  7 

for  arbitrary  f,  an  s such  that  s(t\)  = f(Tj,  i = l,...,n 
iff  Ti-k  < t < T , i = k+1,  . ..,  n. 

In  this  connection,  it  is  interesting  to  note  the  follow- 
ing theorem  published  with  an  elegant  proof  in  1939,  and  pointed 
out  to  me  by  Allan  Pinkus. 

THEOREM  (Krein  and  Finkelstein  [55]).  Let  G be  a Green's 
function  for  the  k-th  order  linear  differential  operator 


L = Zp.Dj 
3=0  i 

with  p.  e C[a,b],  all  j,  and  p never  zero  on  [a,b],  Spe- 
J X 

cifically,  assume  that  G is  of  the  form 


G(x, y)  = 


A 0 . (x)^  (y)  for  x > y, 
j=l  J J 


/.  0.(x)$  (y)  for  x < y, 
j = l J J 


with  both  ^nd  (3  J ^ linearly  independent  and  in  ker  L. 

/ X « • • • y X \ 

If  det  G 1 r)  >0  for  all  nondecreasing  (x,)f 

\yv  • • ->yrJ  f 1 


for  all  nondecreasing  (x^) ^ 


U r- 

1 

T3 

C 

CO 

then 

j.  j. 
det  G 

\yv---,yTt 

if  and  only 

Xi-P  < yi 

i = 1,  . . . , r 

-q. 

Since 

S,  as  defined 

ftk7 Cn+1]7 

it  is  possible 

ment  involving  B-splines  provided  we  make  the  assumption  that 
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(7.3) 


t c [ t.  • t . . 
1*  ’ n k'  n+1 


It  is  also  possible  to  prove  directly 

THEOREM  7.2.  If  t,  < ...  < T , then  (N.(t.))?  is  invertible 
— 1 n j i J.  

if  and  only  if  e supp  N , i.e. . N^(t^)  ^ 0>  all  i. 


In  other  words,  (N.(t^))  is  invertible  iff  its  diagonal 
is  invertible.  Burchard  [21,  Chap.  Ill,  2(3)]  and  Karlin  [47, 
Chap.  10,  Lemma  4.1]  both  prove  Theorem  7.2  explicitly  in  terms 
of  B-splines,  with  simple  knots,  but,  on  the  other  hand,  more 
generally  for  Chebyshev  splines. 

Karlin  and  Ziegler  [53]  remove  the  restriction  in  Theorem 
7,2  to  simple  knots.  They  also  allow  for  repeated  or  oscula- 
tory  interpolation  and  consider  Chebyshev  splines  rather  than 
just  polynomial  splines.  Straightforward  translation  of  their 
result  to  B-splines  would  require  assumption  (7.3). 

We  will  now  quit  belaboring  this  minor  point  and  state  the 
theorem  directly  in  terms  of  B-splines. 


THEOREM  7.3  (Karlin-Ziegler  [53]  extension  of  Schoenberg-Whit- 

ney) . Let  T.  < ...  < T be  such  that 
1 - — n 

t,  . = ...=t.  =t.  . =...=t.  implies  r + s < k, 

i+1  l+r  j+1  j+s  — c — ’ 

and  define  linear  functionals  ( p^)  ^ by  the  rule 

M^f  :=  f^  (t  ) with  j :=  max[r  I Ti_r  = t^). 

Then  (p^N^)  is  invertible  if  and  only  if  Ni(Ti)  ^0,  i=l,...,n. 

A simple  proof  of  this  theorem,  using  only  elementary  pro- 
perties of  B-splines  and  Rolle's  theorem,  can  be  found  in  [15], 
Theorem  7.3  states  conditions  under  which  it  is  possible 
to  interpolate  by  linear  combinations  of  all  B-splines  for  a 
given  knot  sequence.  A careful  study  of  Karlin's  proof  [47]  of 
the  total  positivity  of  (Nj(T^))  reveals  the  fact  that 
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Theorem  7.3  remains  valid  if  we  replace  the  sequence  (N.)  by 
one  of  its  subsequences. 

THEOREM  7.4  [15].  Under  the  same  assumptions  as  those  of  Theo- 
rem 7.3,  and  for  any  subsequence  (q^,...,^)  £f  (l,...,n), 

det(p^N^  )™  . j >0  with  equality  iff,  for  some  i,  (t^)  = 0. 


This  theorem  implies  at  once  the  total  positivity  of 


THEOREM  7.5  (Karlin  [47]).  Let  t < 


Then  (N.(t  )) 

is  totally  positive,  t.e,,  all  its  minors  are  nonnegative. 


< T . 
— n 


Karlin  [47,  p.  563]  states  that  this  theorem  was  communi- 
cated to  him  by  Schoenberg. 


COROLLARY. 


(N^)  is  a weak  Descartes  system,  i.e.,  any  subse- 
quence (N^  ) ^ £f  (N^) ^ is  a weak  Chebyshev  system. 

The  total  positivity  of  (N . (t  ) ) provides  bounds  on  the 
effect  of  rounding  errors  when  solving  (7.1)  by  Gauss  elimina- 
tion without  pivoting  which  are  smaller  than  those  obtainable 
for  general  matrices  even  when  using  pivoting  [18].  This  means 
that  it  is  reasonable  to  solve  the  banded  system  (7.1)  without 
pivoting  with  the  attendant  savings  in  storage  and  program  com- 
plexity. 

The  total  positivity  of  (N^(t^))  is  used  in  an  essential 
way  by  Karlin  and  Pinkus  [51]  in  their  extension  to  splines  and 
to  higher  derivatives  of  earlier  results  by  C.  Davis  and  Viden- 
ski  concerning  the  existence  of  a polynomial  of  degree  n on 
[0,1]  with  a prescribed  sequence  of  n+1  extrema. 

The  total  positivity  of  (N.(t  ))  leads  to  one  of  the 
more  striking  spline  approximation  schemes,  Schoenberg's 
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variation  diminishing  spline  approximation,  which  has  found 
much  use  in  computer-aided  design  (see,  e.g.,  Riesenfeld  [69]). 
We  recall  some  notation.  A real-valued  function  f on  some 
subset  D of  R has  at  least  m strong  sign  changes  if  f 
alternates  (in  sign)  on  some  (t^)™  if 

f (Tq)  ^ 0 and,  in  case  m > 0,  f (t^  ^)  f (t  ) <0  for  i=l,  . . .,m, 

for  some  nondecreasing  sequence  (x  )™  in  D.  It  is  customary 
to  denote  by 

s'(f) 

the  total  number  of  strong  sign  changes  of  f on  its  domain. 

It  is  well  known  (e.g..  Theorem  5.1.4  of  [47])  that,  for  a to- 
tally positive  matrix  A and  any  vector  CC, 

S‘(A a)  < S~  (CO, 

i.e.,  a totally  positive  matrix  transformation  is  variation  di- 
minishing. Since  (N^(t\))  is  totally  positive,  it  follows 

that  the  linear  map  V^,  given  for  some  nondecreasing  t by 
n 

(7.4)  V f :=  Z f (t  , ) N . , all  f, 

t j=1  J J 

is  variation  diminishing,  i.e.,  S (V^f)  < S (f) . Recall  now 
from  Marsden's  identity  (see  (5.6)  and  (5.7))  that,  for  any 
straight  line  p and  any  x with  e (t^, t^+^),  all  i, 

n f r k-1 

p = Z Jp(t .)  + p'(t  ) £ ti,r  • (k-DT, 

j=l  I J J U r=l  J r 

on  [t^, t^+^].  Therefore,  with  the  particular  choice 

(7.5)  T*  : = +...+  t ^) / (K“ 1) , J = 

mentioned  already  in  (5.15),  V ^ reproduces  TP^  on  [t^,tn+^], 
and  we  have 


/ (k-l)>  N 
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(7.6)  s'(VT*f-p)  < s'(f-p)  on  [tk,tn+1],  all  p e T 2>  all  f. 

The  resulting  approximation  V ^f  to  f is  Schoenberg's  varia- 
tion diminishing  spline  approximation,  introduced  by  Schoenberg 
in  [73]  and  further  discussed  in  Marsden  and  Schoenberg  [63], 

We  note  th*  following  result  due  to  Marsden  [62]:  Write 

V to  stress  dependence  on  k,  and  restrict  t so  that 


t . = ...  = t,  = 0 and  t . 
i k n+1 


= t , = 1.  Then 

n+k 


(7.7)  V_,  , -*  1 pointwise  on  C[0,1]  iff  max.  At./k  -» 0 , 

k I I 

as  Marsden  shows  with  the  aid  of  the  Bohman-Korovkin  theorem 
concerning  strong  convergence  of  positive  operators  to  the  iden- 
tity on  C [0, 1]  . 

It  is  possible  to  refine  the  proof  that  S (ACi)  < S '(a) 
for  a totally  positive  matrix  A for  the  particular  choice 
A = (Nj(tj))  30  as  to  obtain  the  following  theorem. 

n 

THEOREM  7.6  [15],  _If  f :=  Z QI.N  alternates  on  (t )™,  then 

j=i  J J 1 U 

f(Ti)0!q  Nq  (V  >0’  1 = 

for  some  subsequence  q oj  (1, . ..,n). 

Theorem  7.6  illustrates  the  point  made  earlier  that  B- 
spline  coefficients  "model"  the  function  they  represent.  A 
spline  cannot  change  sign  at  a point  without  its  B-spline  se- 
quence also  changing  sign  "nearby." 

t 

As  a specific  application  of  this  theorem,  consider  the 


spline  N 


which,  by  (5.2),  is  the  linear  combination  of 


j+1  B-splines  (of  order  k-j),  hence  cannot  have  more  than  j 
strong  sign  changes,  by  Theorem  7.6.  On  the  other  hand,  if 
^ is  continuous,  hence  absolutely  continuous,  then 
is  orthogonal  to  P . on  [t  ,t  .],  by  (4.15),  therefore  must 

1 l'K 

have  at  least  j strong  sign  changes. 
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COROLLARY  [30],  B-splines  are  bell-shaped.  Precisely,  if  Np  ® 

/ • \ 1 

is  continuous  for  some  j < k,  then  has  exactly  j zeros 

in  (t^,ti+^),  all  simple,  i.e.,  there  exists  (|  )q+* 

r ( i) 

t.  = |„  < ...  < = t.  so  that  (-)  N;J  >0  on  (£  , £ 

l 0 j+1  i+k  l — ' r r+1 

r = 0,  . . ., j. 

Finally,  we  record  the  relationship  between  B-splines  and 
Polya  frequency  functions  discovered  by  Curry  and  Schoenberg 
[30],  By  definition,  a Polya  frequency  distribution  is  any 
distribution  function  F (i.e.,  any  function  of  the  form 

F(x)  = / f(s)ds  with  f nonnegative  and  F(a>)  = 1)  whose  bi- 

- 00 

lateral  Laplace  transform  is  of  the  form 

00 

/ e”  SXdF  (x)  = 1/tKs) 

-00 

with 

2-00  - 

. -ys  +os  "tt  ..  - .-os 

i)f(s)  = e ||  (l+&^s)e  v 

5s 

for  some  y > 0,  5 real,  and  (5  ) e If  t(s)  = e , then 

^ 5S 

dF  has  its  entire  unit  mass  located  at  x = 8.  If  \J/ (s)  4 e , 

then 

00 

/ e SXA(x)  dx  = l/\|/(s) 

- 00 

with  A a Polya  frequency  function,  i.e.,  a nonnegative  inte- 
grable  function  on  1R  (normalized  to  have  / A = 1)  for  which 
the  kernel 

K(x,y)  :=  A(x-y) 

is  totally  positive  of  all  orders. 

Call  F.  a spline  distribution  function  of  order  k if 
k 

F^  has  a B-spline  of  order  k as  its  density,  i.e.,  if 
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VX)  = k lco[T0---Tk](*-S) 


k-1 


ds 


for  some  x_  < . . . < x,  with  x.  < x . Note  that  F,  (x)  = 0 for 
x < XQ  and  F^(x)  = 1 for  x > x^,  by  (A. 2).  Further,  say  that 
F^  converges  to  a distribution  function  F in  case  lim^^F^(x)  = 
F(x)  for  all  points  x at  which  F is  continuous. 

THEOREM  7.6  [30].  The  distribution  function  F is  a Polya  fre- 
quency distribution  iff  F is  the  limit  of  a sequence  (F^)  of 
spline  distributions,  with  F of  order  k,  all  k. 

K. 

8 . "Best"  Interpolation 


In  this  section,  I finally  discuss  an  aspect  of  splines 
which  many  consider  to  be  the  primary  characteristic  of  splines, 
viz.  the  fact  that  splines  are  solutions  to  interesting  variation- 
al problems.  This  property  of  splines  is  closely  related  to  the 

fact  that  the  B-spline  M.  represents  a k-th  order  divided  dif- 

i , k 

ference.  As  mentioned  already  in  (4.1),  if  a < t,  < ti->k— then 


(8.1) 


•’Wf 


b 

= / M, 

a 


i.k 


(s) 


f (k) (ds) /k! 


for  every  f e]M^[a,b]  :=  {feCk  ^[a,b]|f^k  ^ abs.cont.,  f*~k  ^eBV). 

Details  for  the  material  in  this  section  can  be  found  in 
[14]  and  its  references. 


Consider  the  problem  of  minimizing  (| f 


00 


over 


(8.2)  F :=  F (x,a,k,  [a,  b])  :=  [f  e lLk[a,b]  ; ft  = a} 

P P P 

for  given  x :=  (x^)  in  [a,b],  nondecreasing  with  x^  < t^+^, 
all  i,  and  given  (X  e ]Rn,  with  [a,b]  finite,  positive  k < n and 
p e [l,oo].  Here,  f!  is  the  sequence  (f^)^  given  by  the  rule 


f :=  f^  (xi),  with  j :=  max  { r | Xj_r  = x^. 

Fp  is  not  empty.  It  contains,  e.g.,  exactly  one  polynomial  of 
degree  < n.  Therefore, 
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Fp  = {f  e Va’bl  |f  T = fa  T} 


for  some  fixed  f e F . Favard  [35]  already  knew  and  used  the 

u p 

fact  that 


inf  ||f 
feF 

P 


(k) 


with 


Ip  = inf  Hgllp 

geG 

P 


(8.2')  cp  :=  tf(k)  |feFp)  = Cgel.p  l/M1>kg  = k!  [t±,  . . 

i = 1,  . . .,n-k). 


Let  now  1 < p < oo  and  1/p  + 1/q  = 1.  Then,  following 
Krein  [54],  we  recognize  that  minimization  of  ||g^||  over 
can  be  viewed,  dually,  as  the  construction  of  an  extension 
A e 1L  = IL*  of  minimal  norm  to  all  of  1L  [a,b]  of  the  linear 

p q q ’ 

n™  Ic 

functional  A , given  on  & = span(M  ) IL  [a,  b]  by 

OC  iCjT  I,  k 1 C| 

V -*R:  ¥i#i,kH¥lk:|li 

This  is  so  since  G , as  a subset  of  IL*,  coincides  with  the 

p q 

set  of  all  extensions  of  A . Therefore 

a 


(8.3)  inf  ||  f 
feF 

P 


(k) 


lp  = .ml Ml*  el*,  A|  = Aa)  = ||*all  , 

X,  T 


by  the  Hahn-Banach  theorem,  settling  existence  of  a minimal  f 

in  F as  well.  Further,  a minimal  f must  agree  with  f 

P & a 

at  Ti,***,Tk  while  its  k-th  derivative  satisfies 
(8.4)  / f(k)  (s)*(s)  ds  = ||f(k)  |Llk|| 


for  any  IL  - extremal  \|r  of  A^,  i.e.,  for  any  i|r  with 


(8.5)  e A 

k,  t 


and 


= 1 and  A = ||A  ||. 
q cr  11  a" 
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If  A 0,  then  there  is  a polynomial  of  order  k in  F 
a ' p 

and  it  is  the  unique  minimizer  for  all  p.  Otherwise  A / 0. 

a 

But  then,  for  1 < q < oo,  A has  exactly  one  extremal  and  the 

^ /i  \ 

equality  (8.4)  in  Holder's  inequality  then  forces  r to  sa- 

tisfy 

(8.6)  f^k)  ||Aal|U|q  1 signum  \J/. 


It  follows  that 


(k) 


is  uniquely  minimized  on  F , and  the 
P „ P 

minimizer  is  the  unique  element  f^  of  the  nonlinear  family 


(8.7)  (f  e !L*[a,b]  | f^  = |\)i  |q  isignum  \|/  for  some  i(f  e ^ ^}, 


q-i 


> 1 

Such  functions  have  been  called  1L  - 

P 


for  which  ? I f I . 

p 'x  CC I T 

splines  by  Golomb  [42]  who  was  apparently  the  first  to  describe 
their  structure. 

For  p 2,  the  family  (8.7)  is  linear  and  consists  of  all 
. k . , <-(k)  , 

f e ILj  with  f e «8^  To  describe  the  corresponding  mini- 
mizer, let  t be  the  extension  of  x to  a nondecreasing  se- 
quence having  both  a and  b occurring  exactly  2k  times.  Then 

A 

the  minimizer  in  F is  the  unique  f in  £ which,  in  ad- 

4 2 Zk , t 


dition  to  the  condition  f 


2 T 


U X 


, also  satisfies 


(8.8)  (x.-a)f^2k"i)(a')  = (b-x^^)  f <2k_i)  (b')  = 0,  i = l,...,k. 


The  minimizer  has  been  called  by  Schoenberg  [73]  the  natural 

spline  interpolant^  of  order  2k  with  interior  knots  ri>'"’’Tn 

for  f in  case  a < x,  and  t < b,  in  which  case  all  the 
a In’ 

A 

constraints  (8.8)  on  f^  are  active.  At  the  other  extreme, 

when  none  of  the  constraints  (8.8)  on  f^  is  active,  i.e., 

when  a x,  ...  x,  and  x , , ...  x = b,  the  mini- 

1 k n-kil  n 

mizer  has  been  called  by  Schoenberg  (see,  e.g..  Lecture  7 of 

[76])  the  complete  spline  interpolant  for  f of  order  2k 

with  interior  knots  x.  x . . The  word  "spline"  itself 

k i 1*  * n-k  1 
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was  chosen  by  Schoenberg  [70]  because  in  Che  case  k = 2 the 
resulting  interpolating  cubic  spline  approximates  (for  small 
slopes)  the  position  of  a mechanical  or  draftman's  spline  forced 
to  go  through  the  given  data  points.  This  connection  between 
(2k-l)st  degree  spline  interpolation  at  knots  and  least- squares 
approximation  to  the  k-th  derivative  has  remained  for  many  the 
major  reason  for  using  splines. 

For  p «>,  (8.4)  fails  to  pin  down  the  minimizer  uniquely 
since  it  only  implies  that 


(8.9) 


: (k) 


= ||Aa!|signum  \]<  off  N :=  {x  e [a,b]  | j>(x)  = 0} 


for  every  JL^-extremal 


of 


Of  course,  if 
a ’ 


has  mea- 


sure zero,  then  it  follows  that  the  minimizer  f is  unique  and 
its  k-th  derivative  is  absolutely  constant,  with  < n-k  break 
points,  by  Theorem  7.5,  since  ij/  is  a nontrivial  linear  com- 
bination of  n-k  B-splines.  In  the  language  of  Glaeser  [40,41], 

f is  a perfect  spline  of  degree  k,  i.e.,  a pp  function  of 
k- 1 

with  absolutely  constant  k-th  derivative. 


order  k+1  in  C 

Whether  or  not 


N, 


has  zero  measure,  supp  i]/  = [a,b]\N 


V 


must  contain  the  support  of  some  B- spline  of  order  k for  the 
knot  sequence  j,  by  Lemma  5.1,  i.e.,  some  interval  (t  ,T^  ) 

on  which  then,  by  (8.9),  ail  minimizers  must  agree.  This  is 
the  "core  interval  of  uniqueness"  of  Fisher  and  Jerome  [36]. 

In  particular,  the  minimizer  is  uniquely  determined  in  case 
n = k+1.  It  is  also  uniquely  determined  in  case  n = 2k  and 


a = t1 


= T, 


k+1 


T2k  = b’ 


as  was 


found  by  Glaeser  [40,41],  since  now  t ^ bj. 


For  the  specific  data  f^(x) 


/ (s-a)^  ^(b-s)k  ^ds,  Louboutin 

u 


[58]  (see  also  Schoenberg  [75,76])  found  f explicitly  in  this 
(k) 

is  evidently  orthogonal  to  ^ ^ T on  [a , b J , 

must  be  a step  function  with  < k jumps  and 


case : 


a 


, . *(k) 

therefore  f 
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orthogonal  to  P,  . on  [a,bj.  But,  since  F . is  a Chebyshev 

K “ 1.  /,  v K*  i. 

A (k) 

system,  this  pins  down  signuxn  f uniquely  up  to  multiplication 
by  a sign  ere  {-1,1}, 


*(k)  . „(1) 

signum  f = <T  signum 

with  C.  (x)  = (*)k  ^G,  (2^—^  - 1)  and  C the  Chebyshev  polyno- 
k KD“cl  /iv 

mial  of  degree  k.  It  follows  that  f'  ' is  a B-spline  of 
order  k with  simple  knots  at  the  k^l  extrema  of  on 

f a , b J (see  (4.15)).  But,  in  general,  there  will  be  several 

distinct  minimizers.  Karlin  [48]  was  the  first  to  see  that 

* 

among  these  has  to  be  at  least  one  perfect  spline  f of  degree 
k with  < n-k  interior  knots.  Its  derivative  can  be 

constructed  [10]  as  a limit  point  of  the  net  (g£) £ , with 

g the  unique  minimizer  of  in 


G 


b 

(g  e ija,  b)  | /a  0g 


all  0 e S£ } 


where 

00 

Se  Ke(*\  TMKe*>(x)  :=  / exp(-(y-x)2/(2e2)0(y)dy/(e/2^). 

’ - -CO 


The  minimizer  g£  is  in  fact  uniquely  determined,  absolutely 

constant  and  has  < n-k  jumps,  since  the  total  positivity  of 

(N.  ( cr. ) ) for  increasing  a (see  Theorem  7.5)  implies  [47] 
j , k i 

that  (K£N.  k(o"t))  Is  strictly  totally  positive  for  strictly 
increasing  cr;  therefore  any  nonzero  element  y of  SE  van- 
ishes on  < n-k  points.  Final ly,  Favard  [35]  constructed  a min- 

A 

imizer  f which  is  a spline  of  degree  k with  < n-k  interior 

knots,  all  simple,  with  the  additional  property  that,  for  any 
(k)  * (k) 

feF^,  if  | < i f ' | implies  that  f f.  This  minimality 

of  "Favard1 s solution"  is  further  underlined  by  the  fact  that 

k ^ 

it  is,  for  any  r e [l,<w),  the  ]Lr-limit  of  f as  p -*  co  [25]. 

For  p 1,  matters  are  least  satisfactory  since  IL  now 

P 

fails  to  be  the  dual  for  IL  . Therefore,  although  (8.3)  still 
holds  for  this  case,  it  may  happen  that  none  of  the  norm 
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preserving  extensions  of  ? to  ail  of  ]L^  is  representable  as 
integration  against  an  ]L^- function,  in  which  case  the  infimum 
over  is  not  attained.  In  this  situation,  one  may  be  satis- 

fied to  follow  the  lead  of  Fisher  and  Jerome  [37]  and  consider 
the  slightly  different  problem  of  minimizing 

f(k>||  :=  Var  f<k"l> 

over 

Fl  :=  (f  c WR[a,b]  j f |T  = fa|TJ 

instead,  which  always  has  solutions.  If  ^ all  i, 

then  among  these  solutions  is  a spline  of  order  k with  < n-k 
interior  knots,  all  simple. 

We  close  this  section  with  yet  another  B-spline  property, 
this  one  connected  with  perfect  splines,  optimal  recovery 
(alias  best  class  estimators)  and  -approximation  by  splines. 

LEMMA  8.1  (Micchelli  (64j).  If  r = (t  )”  is  nondecreasing  in 
(a,b)  with  n > k,  then  there  exists  (up  to  multiplication  by 
some  a e {-1,1})  exactly  one  sign  function  h with  < n-k 
jumps  which  is  orthogonal  to  «^_  on  [a, b ] . If  3 *0 

...  < 5r+l  b,  and,  for  this  h, ' (-) ih  = 1 sn  (lt,l1+1), 

i 0,  . . . , r,  then  r n-k  and  e ^),  i = l,...,r. 

Micchelli 's  lemma  is  not  entirely  unrelated  to  the  follow- 
ing fact  about  B-splines  useful,  e.g.,  in  the  characterization 
of  best  L^-approximations  by  splines. 

LEMMA  8.2.  _If  t (t  )”  ^ is  nondecreasing,  in  [a,b],  with 
Ci  < ti+k’  and  f e lL^[a,b]  is  orthogonal  to  ^ on 

[a,bj,  then  there  exists  £ = (t^)1^  * strictly  increasing  in 

[a,bj  with  t^  < ^ j (any  equality  holding  iff  t = 

t . .),  t = l,...,n+l,  so  that  f is  also  orthogonal  to  <6,  .. 
l+vc-l  1,| 

Indeed,  since,  for  appropriately  chosen  p e P^,  the 
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function  F :=  p 


/ (--y)k  *f (y) dy/ (k- 1)  ! vanishes  at 


a ' " + 

(counting  multiplicities)  by  assumption,  and 


F is  in  C 


t 

k-1 


[a,b], 


Rolle’s  Theorem  proves  the  existence  of  strictly  increasing 

n ^ in  [a.b]  with  t.  < |,  < t.  . ,,  all  i,  at  which 
* i — i - i+k-1'  ’ 

^ = const  + / (--y)^f(y)dy  vanishes,  which  proves  the 

dL 


lemma.  In  particular,  if  f is  continuous,  then  it  must  vanish 
at  the  n points  of  some  strictly  increasing  sequence  (♦,  ) " with 

i. 


t.  < ti  < t.  . , all 
i i+k’ 


9 . Generalizations 

The  trend  started  by  Schoenberg  [71]  and  Greville  [43]  to- 
ward ever  more  generalized  splines  continues  unabated  but  has 
failed  to  bring  with  it  a corresponding  wealth  of  generalized 
B-splines.  Schoenberg  [71]  actually  described  trigonometric 
B-splines  and  later,  Burchard  [21]  and  Karlin  [47]  independently 
constructed  Chebyshevian  B-splines  with  the  aid  of  Popoviciu's 
[67]  generalization  of  the  divided  difference  notion.  Yet  an- 
other account  can  be  found  in  Marsden's  thesis,  eventually  pub- 
lished in  [61],  in  which  the  generalization  of  Schoenberg's 
variation  diminishing  spline  approximation  for  Chebyshev  splines 
is  given,  but  without  a proof  of  its  variation  diminishing  char- 
acter. ouch  a scheme  had  already  been  described  and  proven  to 
be  variation  diminishing  by  Karlin  and  Karon  [49],  and  their 
assertion  in  [50]  that  Marsden's  B-splines  are  essentially  dif- 
ferent from  Karlin's  is  incorrect. 

Here  are  some  of  the  details  of  the  construction. 

Let  Pf  be  the  polynomial  of  degree  < k which  agrees  with 


f at  the  distinct  points 
then 


1' 


'V 


If  0.(x)  = x 


J-l 


all  j, 


(9.1)  f - Pf 


-I;;:::::;;:, 
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Therefore,  since  [t  , ...,x  x]f  is  the  leading  coefficient  in 

1 K 

the  polynomial  of  degree  <k  which  agrees  with  f at  x^,..., 

x,  , x,  we  have 
k ’ 

(9.2)  f - Pf  Tk,-Jf)  ^..i-^k+i) 

with 


‘V 


■’ V,f  - (£-PE)/<«k+1-P«kn> 


det 


det 


•1' 


'V  *\ 


k+1 


k+1 


If  now,  more  generally,  (0  .) ^ is  a Chebyshev  system  (on  some 

( ■ • • • ) Tk+l\ 

interval  I),  then  det  I A a 1^0  for  distinct  t 

in  I and  the  following  definition  makes  sense:  The 

k-th  divided  difference  of  f at  the  distinct  points  x^,..., 


Vi  is 


k+1 


I with  respect  to  the  sequence  0 :=  (0  j ) i [67] 

(9.3)  tV-'ViV  :=  dGt  (01L,...,0^,f) 

Then,  with  Pf  denoting,  more  generally,  the  unique  element  in 

span(0.).  which  agrees  with  f at  x ,x.  ..we  have 

j 1 V ’ k+1* 

f-  Pf  «TI,...,tk,.|0£)(<M  - P«V+1) 

which  is  the  formal  analog  of  (9.2).  The  definition  shows  the 
generalized  divided  difference  (9.3)  to  be  a symmetric  function 
of  the  x^'s.  The  definition  even  allows  for  some  confluence 
among  the  ' s provided  the  0^ ' s are  sufficiently  smooth 
and  one  defines  (for  nondecreasing  t) 


det 


rci'***»Tk+i\  / *\c+i\ 

L / : det  L,1  / de t ( m. 0 . ) 

V01,...,0k4i/  '^1' * * ‘^k+r  i 1 


with 


p^f  :=  f^(x^)  and  j :=  max  {r  | x^  f = x^},  in  the  manner 
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of  Theorem  7.3.  More  detail  about  these  generalized  divided 
differences  are  provided  by  Popoviciu  [67],  and  see  also  Muhl- 
bach  [65 ] . 

Assume  that,  in  addition,  (0^) ^ spans  the  kernel  of  a 
k-th  order  linear  ordinary  differential  operator 

(9.4)  L*  : Dk  + Z a.Dj 

j < k J 

with  a.  e C^I),  all  j,  so  that  the  formal  adjoint 

(9.5)  L : (-)kDk  + Z (-)jDj(a.-)  = (-)k(Dk  + Z b.Dj) 


j<k 


j<k 


is  an  operator  of  the  same  kind.  Green's  function  G(x,y)  for 

■jjf  ( i ) 

the  initial  value  problem  L f = g, f (a)  = 0,  j = 0,...,k-l, 
can  then  be  constructed  as 


n • 

(9.6a)  G(x,  y)  (x-y)  z . 0,(x)\|r  (y) 

j 1 J J 


k k 

with  ('['j)  l the  basis  for  ker  L adjunct  to  (0^)  i.e., 

(9.6b)  Z 0^  ^ (x)  V . (x)  = 8 , i = 1,  . . . ,k,  x e I. 
j 1 j J ik" 

rj  4 k 

With  t (t.),  nondecreasing  and  t.  < t.  , , all  i,  the 
= i 1 i i+k' 

function 

(9.7)  M^L(y)  : [t.,  . . . , t.+k  ]0G(  • , y) 

is  then  piecewise  in  ker  L with  breakpoints  t.,...,t.  , and 

k-2  1 

in  C in  case  t.  < ...  < t.  , . In  the  language  of  Gre- 

l l+k 

ville  [43],  M is  a generalized  spline  function  with  respect 
i , L 

to  ker  L.  Coincidences  among  the  t^ 1 s reduce  the  smoothness 

of  M.  across  t.  in  the  usual  way.  Further, 

1,  L j 


(9.8)  M.  vanishes  off  (t  , t ) 

L y Li  " X 1 • K 
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since,  for  y > t1+R,  G(-,y)  I (t  ^ t ) =0  while,  for  y < t±, 

i.  1 ■^‘k 

G ( • , y)  1 . e ker  L*  by  (9.6).  One  also  has  the  analog 

KWk; 


(9.9) 


[tt,  . . .,  t1+kJ^f  - / 


■i+k 


Mi  L(y)L  f(y)  dy* 


k+1 


If,  in  addition,  (0^) ^ is  an  extended  complete  Cheby- 
shev  (or,  ECT)  system,  then  Burchard  [21]  and  Karlin  [47]  have 
shown  the  analog  of  the  Schoenberg-Whitney  Theorem  7.1  that, 
for  strictly  increasing  t and  strictly  increasing  x = (x  )^, 
det(M.  T(x,))  >0  with  strict  inequality  iff  r (xj  4 0, 

J y Li  J-  1;  W 1 


til  i.  Further,  Karlin  [47]  showed  that  (M . (x  ))  _i 

j,  L 1 


s to- 


tally  positive  in  this  case,  as  was  mentioned  earlier.  Few 
facts  beyong  these  are  known  for  Chebyshev  B-splines.  While 
the  analog  of  Marsden's  identity  (5.7)  can  be  found  in  [61], 
the  analog  of  the  linear  functional  (5.4)  has  not  been  described, 
although  that  should  be  fairly  easy.  More  importantly  for  com- 
putations, a recurrence  relation  like  (4.9)  has  been  searched 
for  in  vain  so  far. 

It  is  actually  quite  unnecessary  to  assume  that  (0^) ^ is 
a Chebyshev  system  in  order  to  construct  L-splines  (in  the 
sense  of  Greville)  of  local  support.  Continue  to  assume  that 
(0.)^  is  a basis  for  the  kernel  of  the  differential  operator 
L*  of  (9.4)  with  L of  (9.5)  its  adjoint  and  G the  Green's 
function  given  by  (9.6).  If  t = (t^)"  stri-ctly  increasing, 

then,  for  each  i,  the  span  of  ( [t . 1) ^ contains  a nontrivial 

if  i<  ^ ^ 

M -L  ker  L since  ker  L has  dimension  k.  But  then 


(9.10)  M (x)  : = p G ( * , x) 

M,  L 

defines  an  L-spline  with  knots  t 
Clearly,  M 


(V  ti+k) 


M,  L 


i'  “ 
represents 


, , t and  support  in 
M with  respect  to  the 


pairing  <f,g)  :=  /fL*g.  If  new 


(0^) ^ fails  to  be  a Chebyshev 


38 


LINEAR  COMBINATIONS  OF  B SPLINES 


system  on  ft  . t,  , ].  then  there  exists  a nontrivial  p in 
i i+k 


the 


i^r 

span  of  (ft  ])^  and  orthogonal  to  ker  L*  for  some  r < k, 

i.e.,  the  corresponding  M has  even  smaller  support.  More 

P,  L 

explicitly,  let  (p.^  be  the  sequence 

ft  ]Dk_1, [t  ]D, ft  ],  ft  ft  ],[t  ]D, , [t  ]Dk_1 

1 1 1 2 n n n 

of  linear  functionals  and,  for  each  i,  let  V.  be  the  linear 

7 1 

1 ■♦■r 

functional  of  the  form  = p^  + ^(3^  p^  which  is  orthogonal 

to  ker  L*,  with  r as  small  as  possible.  The  corresponding 

sequence  (M  ) ^ k ^ of  basic  L-splines  is  then  a basis  for 
L 

the  space  of  all  L-splines  on  ft  , t ] with  simple  interior 

1 n 

knots  t„, ...,t  .. 

2 n- 1 

A construction  like  this  was  used  by  Jerome  [45]  under  the 

additional  assumption  that,  for  each  i,  t.  ,-t,  is  small 
r k ’ ’ i+k  t 

enough  so  that  (0^) ^ is  a Chebyshev  system  on  . 


Earlier,  Jerome  and  Schumaker  [46]  had  used  such  considerations 

in  connection  with  Lg-splines,  i.e.,  when  the  linear  functionals 

i “ 1 

(p^)  above  are,  more  generally,  of  the  form  p^  = ^ [t^  ]DJ 

Related  developments  of  great  generality  can  be  found  in  Brown 

[20]. 

We  close  this  section  with  yet  another  B-spline  property 
discovered  by  Curry  and  Schoenberg  [30]. 

LEMMA  9.1  [30],  Let  Mn  , be  the  B-spline  defined  by  (3.1), 
and  let  <J  be  any  k-simplex  in  ]R  of  unit  volume  with  ver- 
tices i = 0,...,  k and  so  that  v^  = l^,  i = 0,...,k. 

Then,  for  all  x, 

M0  k(x^  = la  n (v  6 ^ : VL  = x)  |, 

i.e, , M (x)  gives  the  (k- 1) -dimensional  volume  of  the  in- 

0,k  k 

tersectlon  of  the  simplex  a with  the  hyperplane  in  ]R  which 

intersects  the  v^-axis  at  v^  = x and  is  orthogonal  to  it. 
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In  a letter  f 7 2 J to  P.  Davis,  Schoenberg  recalls  the 
Hermite-Genocchi  formula 

tc  oo 


(9.U)  ,«kJf  /•_••/£  <vozo™iziT'"’Vk;dvr"<lvk 

and  where  the  integration  is  to 


n 

v. 


with  w0  1 - -k 

be  carried  out  over  the  complex 


VL  > °> 


, v,  > 0 . L v . < 1, 

k ~ ' j i ~ 


and  points  out  that  Lemma  9.1  follows  from  this  on  comparison 

with  (4.1).  Schoenberg  further  recalls  that  the  Hermite-Genocdu 

formula  remains  valid  if  z , ...,z  are  points  in  the  complex 

U K 

plane  not  all  on  one  line  and  if  f is  a comp  lex- valued  func- 
tion regular  in  the  convex  hull  ""[7  of  z , ...,z^.  The  formu- 
la (4.1)  now  becomes 

(9.12)  [z 


Q,...,zk]f  M(x,y;zQ, . . ,,zk) f rk;  (x,y)  dxdy/kl 


At  the  point  z (x,y),  M(x,  Y>zq>  • • • > z^)  is  therefore  the 
(k- 2) -dimensional  volixne  of  the  intersection  of  the  plane 


, v.z  y)  with  a simplex  of  unit  volume  whose 


(v  £ IR  : x 

i-th  vertex  v^  satisfies  (v^,v^h  z j.  In  particular, 

M is  positive  on  j i and  zero  off  J 1 and  is  a spline  of 
order  k-1  along  any  straight  line,  with  knots  only  at  the  points 
where  such  a line  intersects  a segment  [z  ,z.J.  Schoenberg's 
letter  even  contains  a drawing  of  such  a B-spline  in  two  vari- 
ables for  k - 4. 

This  suggests  the  following  definition. 


s +k 


DEFINITION.  Let  a be  a nontrivial  simplex  in  ]R 
define  the  B-spline  of  order  k f rom  a 

S -flc 

Mk,c/Xl,*“,X s')  lCTn(V£l<  ! vi=xi' 


On  K , 
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all  x e ]RS  . 

Then  M,  _ is  unimodal,  nonnegative,  piecewise  polynomial  of 

a 

total  order  k,  and  in  C in  general.  Its  support  is  the 

5 

projection  of  a onto  ]R  , i.e.,  the  convex  hull  of  the  pro- 

(i ) s k s 

jections  ((v.  )••,).„  of  the  vertices  of  a to  ]R  . 

j j 1 1=0 

At  this  point,  I have  no  idea  how  useful  these  B-splines 
might  be,  even  only  for  the  writing  of  papers.  It  is  easy  to 
visualize  how  such  B-splines  can  be  made  to  give  a partition  of 
unity:  One  takes  some  suitable  convex  set  C in  IR  of  unit 

S S "hk 

volume  and  then  subdivides  the  cylinder  IR  x C in  ]R  into 
nontrivial  simplices.  The  corresponding  B-splines  will  then 
add  up  to  one.  But  it  is  unlikely  that  these  B-splines  will 
become  very  useful  unless  one  finds  some  means  of  evaluating 
them  such  as  a recurrence  relation  like  (4.9). 

In  any  event,  I think  these  B-splines  are  very  beautiful. 
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It  seems  a shame  to  waste  this  almost  empty  page,  so  here 
is  yet  another  reference  to  work  concerning  B- splines:  In 

Phillips,  J.  L.,  and  R.  J.  Hanson,  Computing  integrals  involv- 
ing B- splines  by  means  of  specialized  Gaussian  quadrature 
rules,  TR  #CS-73-001,  Comp.  Set.,  Washington  State  U., 
Pullman,  WA.,  1973, 

one  finds  a discussion  of  the  procedure  for  generating  the 
three- term  recurrence  relation  for  the  polynomials  orthogonal 
with  respect  to  a B-spline  as  weight  function,  as  well  as  the 
abscissae  and  weights  for  the  corresponding  Gauss  quadrature 
rule.  For  orders  2 and  4,  and  for  a uniform  knot  sequence, 
specific  numbers  are  on  microfiche  in 

Phillips,  J.  L.,  and  R.  J.  Hanson,  Gauss  quadrature  rules  with 
B-spline  weight  functions,  Math.  Comp.  28  (1974),  666. 
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This  survey  of  nonlinear  approximation  theory  is  given 
with  emphasis  on  the  "local-global-principle".  It  starts  with 
the  functional  analytic  approach.  The  theory  of  Rice  and 
Meinardus/Schwedt  for  nonlinear  Chebyshev  approximation  are 
not  only  considered  with  respect  to  their  merits  for  the 
approximation  by  rational  functions  and  by  exponentials.  They 
made  the  way  for  Wulbert's  approach  to  global  analysis  conti- 
nued by  the  author.  For  the  treatment  of  the  exponentials 
these  modern  tools  are  necessary  though  the  theory  is  not  yet 
complete  for  n i 4.  The  difference  between  approximation  with 

respect  to  L -norms  and  the  sup-norm  becomes  obvious. 

P 

I Introduction 


This  lecture  is  anounc.ed  as  a survey  of  nonlinear 
approximation  theory.  In  the  last  years  there  were  so  many 
activities  in  so  many  directions  that  it  seems  to  be  impossib- 
le to  give  a fairly  complete  survey.  Therefore,  I have  to 
focus  my  attention  to  the  central  problems.  In  order  to  give 
you  an  impression  of  the  progress,  the  problems  are  discussed 
from  the  viewpoint  of  what  will  be  called  the  "local-global 
principle".  This  principle  turns  out  to  be  crucial  for  the 
total  development. 

Because  of  the  incompleteness  of  my  talk  and  the  list  of 
references  the  reader  is  referred  to  the  following  articles 
witii  survey  character  [ 1,4,13,27,351  and  bibliographies  |9, 
I4,34|  . 

In  nonlinear  approximation  theory  most  investigations 
are  based  on  the  concept  of  best  approximation:  Given  a non- 

Preceding  page  Hank 
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void  set  G in  a normed  linear  space  E and  an  element  f^G,  then 
j^G  with  minimal  distance  from  f is  called  a nearest  point  or 
best  approximation  (or  short:  a solution).  It  is  called  a 
local  best  approximation,  if  it  Is  a best  approximation  to  f in 
some  neighborhood  of  g in  G . 

A set  G^E  is  called  an  existence  set  (uniqueness  set, 
Chebyshev  set,  resp.)  if  for  each  f€E  there  is  least  one 
(at  most  one,  exactly  one,  resp.)  best  approximation.  For  an 

Q 

existence  set  G the  metric  projection  P=P  :E  — >2  is  the 

o 

mapping  which  sends  each  element  f to  the  set  of  its  best 

approximations.  If,  moreover,  G is  a Chebyshev  set,  then  P 

G 

may  also  be  understood  as  a mapping  onto  G instead  of  2 . 

There  are  only  a few  non-trivial  Chebyshev  sets.  It  is 
remarkable  that  one  of  them  was  found  already  very  early  by 
Chebyshev.  He  recognized  that  the  best  approximation  in  the 
set  of  rational  functions 

(1.1)  R^,  r“{g=p/q^Cl  o,  1)  ; p,q  polynomials,  3p«li,3q$r} 

is  always  unique,  if  C|o,l|  is  endowed  with  the  sup-norm.  The 
rational  functions  also  serve  as  a "picture-book  example"  for 
most  of  the  recent  theories.  This  family  behaves  almost  as  a 
linear  one.  There  is  only  one  complication,  the  metric  pro- 
jection is  not  continuous  at  f,  if  Pf  is  degenerate,  i.e.  if 

p£€Rv-.,r-r 

On  the  other  hand  the  best  rational  approximation  is 
not  always  unique,  if  the  function  space  is  endowed  with  an 
Lp-norm,  l<p^eO.  This  result  was  first  given  in  1961  by 
Efimov  and  Stechkin  | 2o|  as  a byproduct  of  the  geometric 
theory.  In  this  context  it  is  worth  noting  that  it  lasted 
9 years  til  an  example  for  nonuniqueness  was  published  in  the 
thesis  of  Laxnprecht  | 24 J . Hut  as  is  well  known  the  cited 
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result  is  not  the  only  one  in  that  theory. 

because  of  the  extraordinary  role  of  nonlinear  Chebyshev 
approximation,  here  the  analytic  approaches  of  Rice  [31]  and 
Meinardus  and  Schwedt  [ 28)  are  more  important.  Even  if 
uniqueness  does  not  hold,  it  is  often  possible  to  derive  a 
(finite)  bound  for  the  number  of  solutions  [ 7,8] . For  this 
aim  new  methods  must  be  developped  [ 51  . From  this  viewpoint 
Wolfe's  recent  result  on  the  non-existence  of  a bound  for 
rational  L^-approximat ion  [ 36)  is  of  great  interest. 

We  close  the  lecture  with  a remark  on  non-linear  inter- 
polation. 

2 Convexity  of  Chebyshev  sets 

The  first  steps  into  the  study  of  Chebyshev  sets  by 
geometrical  tools  were  done  by  Bunt  (12)  in  1934  and  by 
Motzkin  [ 3o)  in  1935.  It  lasted  15  years  til  a first  genera- 
lization of  Motzkin's  result  to  infinite  dimensional  spaces 
was  carried  out.  Until  the  middle  of  the  last  decennium 
Victor  Klee  [ 23) , Efimov  and  Stechkin,  Vlasov  and  later 
Brosowski  and  Deutsch  developped  the  theory.  Since  an 
excellent  survey  has  been  given  recently  by  Vlasov  [ 351  , we 
restrict  our  attention  to  the  main  results.  We  will  try  to 
understand  why  there  are  still  so  many  open  problems. 

DEFINITION  2.1.  A normed  linear  space  E is  called  strictly 

i 

convex,  i f 1 xH  =1  yl  * 1 , x^y  imply  II ^-(x+y) B < 1 . It  is  called 
uniformly  convex  if  given  e>0  there  is  a <5>0  such  that 
II  xll  =11  yl  = 1 , lx-yl>6  imply  II  ~;(x+y)  I ^ I -c . A normed  linear  space 
E is  smooth,  if  its  dual  space  E'  is  strictly  convex. 

Motzkin's  result  may  be  stated  as  follows: 

THEOREM  2.2.  ££  G is  a subset  of  a strictly  convex,  smooth 

space  of  finite  dimension,  then  the  following  are  equivalent: 
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I G is  a Chebyshev  set. 

2°  G is  closed  and  convex. 

The  theorem  applies  to  Euclidean  n-space.  It  is  still  an 
open  question  whether  it  may  be  extended  to  Hilbert  spaces 
without  lurther  hypothesis.  On  the  other  hand  it  is  in- 
teresting to  note  under  which  conditions  the  generalization 
is  possible. 

DEFINITION  2.3.  A sequence  {g  } in  a subset  GCE  is  called  a 
minimizing  sequence,  if  1 im  II  f-g  Jl  = inf  { II  f-gll  ;j^G} . 

G is  called  approximat ive 1 y compact,  if  each  minimizing 
sequence  in  G contains  a subsequence  which  converges  to  an 
element  in  G.  G is  called  bounded ly  compact,  if  the  inter- 
section  of  G with  each  closed  ball  is  compact. 

Here  convergence  is  understood  in  the  strong  topology. 
Analogous  definitions  referring  to  the  weak  topology  are  made 
in  the  same  manner  (c.f.|  12,35]). 

In  the  following  a simple  observation  will  be  crucial. 

LEMMA  2.4.  The  metric  projection  onto  an  approximat ivelv 
compac  t C he by she v set  is  a continuous  mapping. 

The  proof  of  a generalization  of  Theorem  2.2  splits  into 
two  parts.  The  concept  of  a sun  is  used. 

DEFINITION  2.5.  A sun  (strict  sun,  resp.)  is  an  existence 
set  G in  a norined  linear  space  E such  that  for  anv  f^E  and 
for  at  least  one  t£-Pf  (for  all  g£Pf,resp.)  we  have 
g€P|  g+A  (f-g)|  for  all  X'\. 


THEOREM  2.6.  Each  boundedly  compact  Chebyshev  set  in  a 


Banach  space  is  a strict  sun . 


Sketch  o 1 proof.  Let  f <?C.  Choose  a ball  B=Br(f,)  with  cen- 

1 O I 

ter  1 | and  radius  6 which  is  disjoint  from  G.  The  image  of 
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the  metric  projection  P(B)  is  bounded,  hence  relatively  com- 
pact. Now,  let  4>  be  the  mapping,  which  sends  V into  the  boun- 
dary of  B: 

♦<s>-V  6t?^- 

Consequently,  $•» P is  a continuous  mapping  of  B into  a relati- 
vely compact  subset.  By  Schauder's  theorem  there  is  a fixed 
point.  Uniqueness  of  the  best  approximation  implies  that  the 
fixed  point  is  Pfj+A(fj-Pf  ) with  A« 1 +6 (H f -P (f j ) I ) *.  it 
follows  from  an  open-closed  argument  that  the  assertion  holds 
for  all  A > I . □ 

LEMMA  2.7.  Each  sun  in  a smooth  normed  linear  space  is  con~ 
vex. 

Sketch  of  proof.  Let  C be  a sun.  Assume  that  g|,g^=C,  but 
for  some  aS(o,l)  we  have  f*ag  | ♦(  l-u)g^G.  Let  g€p(f)  be  a 
best  approximation,  to  which  the  definition  of  a sun  applies. 
It  follows  from  Theorem  3.1  that  for  i=l,2,  there  is  a hyper- 
plane containing  g which  separates  f from  g..  Smoothness 
implies  that  both  hyperplanes  coincide.  This  yields  a contra- 
diction, because  f lies  on  the  straight  line  connecting  gj 
and  g^.  0 

By  combining  Theorem  2.6  and  Lemma  2.7  one  gets  the 
fol lowing  theorem. 

THEOREM  2.8.  Each  bounded ly  compact  Chebyshev  set  in  a 
smooth  Banach  space  is  convex. 

There  have  been  several  attempts  to  weaken  the  compact- 
ness assumption  and  to  replace  it  at  least  by  the  continuity 
of  the  metric  projection,  (continuity  understood  as  weak  as 
possible)  | 17,2o,23,35| . But  it  seems  to  be  impossible  to 
abandon  it  completely. 
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Indeed,  recently  Dunham  (19]  has  discovered  an  example  of 
a Chebyshev  set  which  is  not  a sun. 

EXAMPLE  2.9.  Let  E=C[o,lj  be  endowed  with  the  uniform  norm 
and  G=‘F|a]  ; a^o}  where 

f(Dl)e't/a  if  a'O, 

F (a , t)  = < 

V.  0 if  a=0. 

Since  the  mapping  a * F(a)  is  continuous  with  respect  to 

compact  convergence  on  the  open  interval  (o , 1 ) , the  family  G 
is  compact  when  endowed  with  that  topology.  By  standard  argu- 
ments it  is  an  existence  set.  Uniqueness  follows  from  the 
monotonic ity : F (a,t )-F(b, t) >0,  whenever  a>b.  But  G is  not  a 
sun.  The  zero  function  is  optimal  for  i(t)=  but  not  for 
f(t)-  j,  x > i . 0 

We  conclude  this  section  with  a consideration  of  the 
continuity  of  the  metric  projection  from  the  global  point  of 
view. 

If  we  have  only  the  definition  of  continuity  in  mind, 
then  it  seems  to  be  a local  property.  Continuity  of  P onto  a 
Chebyshev  set,  however,  establishes  global  properties.  For 
example,  it  implies  connectedness:  given  the  mapping 

I o,  IJ  * X » P | ( 1 - / ) g j +X  g^l  furnishes  a connecting  arc. 

More  generally,  G must  be  contractible;  all  homology  groups 
must  be  trivial.  From  this  viewpoint  it  is  also  not  only  a 
coincidence  that  in  the  central  step  when  proving  Theorem  2.8 
a fixed  point  theorem  was  involved. 

I belive  that  the  gap  in  the  functional  analytic  approach 
results  from  the  fact  that  homology  is  a structure  which  is 
too  different  in  nature.  Note  that  the  family  in  Example  2.9 
is  not  connected  with  respect  to  the  strong  topology,  but 
it  is  connected  with  respect  to  the  weaker  topology,  in  which 
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the  parametrization  is  continous. 

We  note  that  these  difficulties  are  not  present  when 
manifolds  are  investigated.  So  manifolds  lead  to  an  alterna- 
tive approach  (s.  below). 

3 Suns 

The  concept  of  suns  has  turned  out  to  be  an  effective 
tool  in  nonlinear  approximation;  it  is  not  only  employed  in 
connection  with  the  problem  of  the  preceding  section.  It  is 
our  aim  to  characterize  suns  in  a geometrical  way  is.  Lemma 
3.3). 

THEOREM  3.1.  For  an  existence  set  G in  a normed  linear  space 
E the  following  are  equivalent: 

1 1 G is  a strict  sun. 

2°  For  any  f€E,  any  g €P„(f)  and  any  ^G,  the  element  g is 

O (j  ■ ———————  o 

also  a best  approximation  to  f in  the  convex  hull  of 
g _§nd  gQ. 

3J  Given  f^E  we  have  g eP„(f),  if  and  only  if  for  any  *€G 

O C/ 

there  is  a linear  functional  6 € E ' (which  may  be  chosen 
as  an  extreme  point  in  the  unit  ball  of  E ' ) satisfying 

HI!  8 =1 t 

«(£-*„)-> f-g0l. 

Re  fc  (g-gQ)^0. 

The  condition  specified  in  3°  is  a generalization  of 
Kolmogorov's  criterion.  The  proof  that  it  is  a necessary 
and  sufficient  condition  for  a solution  in  strict  suns  was 
simplified  by  inserting  condition  2°.  The  equivalence  of 
1°  and  2°  follows  from  a simple  geometrical  argument.  On 
the  other  hand,  the  equivalence  of  2°  and  3J  is  a consequence 
of  Singer'  characterization  of  best  approximations  in  convex 
sets  via  separating  hyperplanes. 
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We  emphasize  that  the  linear  functional  in  Kolmo- 
gorov's criterion  depends  on  g.  But  strict  convexity  of  E' 
implies,  that  the  functional  is  already  determined  by  f-g^. 
Then  the  difference  to  Singer's  characterization,  which  is 
restricted  to  convex  sets,  vanishes. 

In  the  framework  of  this  lecture  we  ask  for  characteri- 
zations of  suns  by  internal  (or  intrinsic)  properties.  To  be 
more  precise  the  specification  shall  not  refer  to  the  approxi- 
mation problem.  For  other  characterizations  the  reader  is 
refered  to  the  literature  1 1 , lo , 1 I , 35)  . The  answer  is  trivial, 
when  E is  a smooth  space  (c.f.  Lemma  2.7).  A characteri- 
zation of  the  desired  type  for  C(Q)  endowed  with  the  sup-norm 
was  given  by  Brosowski. 

THEOREM  3.2.  (Characterization  of  suns).  A non-void  set 
GCC(Q)  is  a sun  if  given  a pair  g,gQ-G  and  > '0,  then  for 
every  closed  set  A satisfying 

min  I g(t)-gfj(t)l  --Q 
t^A 

there  is  a g T=G  such  that  # g -g  11 " / and 

min  Re  { (g  ( t )-gQ (t ) ) (g (t )-gQ (t ) ) } '0. 
t€A 

The  above  characterization  shows  the  connection  between 
local  and  global  features:  g may  be  an  element  which  is 
arbitrarily  far  from  , but  it  has  an  influence  on  each 
neighborhood  of  g . 

It  is  probably  a question  of  personal  taste  as  to  whether 
the  generalizations  of  Theorem  3.2  [ lo,ll|  are  still  conside- 
red as  characterizations  by  "intrinsic"  properties. 
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In  the  finite  dimensional  case  there  is  an  intuitive 
geometrical  interpretation  for  the  characterization  of  suns 
[4]  . 

LEMMA  3.3.  Let  G be  a closed  set  in  C(Q),Q  being  a (finite) 
set  with  m points.  Then  the  following  are  equivalent: 

1°.  G is  a sun. 

2°.  Each  pair  of  elements  g , g jGG  may  be  connected  by  a 

continuous  curve  jg,  ,0$.AOl  in  G with  the  following  pro- 

A 

perty:  if  the  i-th  coordinates  of  gQ  and  g^  are  not  equal, 

i.e.,  if  (g,-g  ) .+0,  then  the  i~th  coordinate  (g, ) . is  a 
J O 1 A 1 

strictly  monotonic  function  of  A . 

Figure  1 illustrates  that  the  property  of  being  a sun  is 
comparable  to  convexity.  Connecting  curves  exist  in  both  con- 
texts, but  the  condition  on  the  curve  in  the  sun  situation  is 
less  restricting. 


connecting  line 
in  a convex  set 


Fig.  1.  Comparison  of  connecting  curves  in  suns  and  convex 
sets . 

4 Var isolvency 

In  the  preceding  sections  we  have  discussed  the  approach 
in  the  framework  of  functional  analysis.  In  196!  J.R.  Rice 
introduced  a completely  different  concept  for  nonlinear 
Chebyshev  approximation  [ 3 1 1 . (It  may  be  thought  of  as  the 
first  analytical  approach.)  Here  and  in  the  sequel  I denotes 
a compact  non-degenerate  real  interval . 


57 


DIETRICH  BRAESS 


DEFINITION  4.  1 . Let  GCC  (I ) . 

(i)  G is  said  to  be  locally  solvent  of  degree  m ajt  g , if 
given  t>0  and  m distinct  points  x^€I,  i*  1 , 2 , ...  ,m,  there 
is  a 6>0,  such  that  I gQ(x^)-y  J <<$ , i*l,2,...  ,m,  implies 
that  there  is  a g€G , satisfying  Hg~gQ II  < e and 

8^Xi)-yi=0, 

(ii)  G is  said  to  have  Property  Z of  degree  m at.  g^ , jjf  g-g^ 

has  at  most  m- 1 zeros  whenever  g€G,  gtg  . 

( i i i )  G satisfies  the  density  property,  if  given  g^SG  and 

e>0,  there  are  g,>g„€G  such  that  Hg.-g  N<e.  i=l,2,  and 
i i l o 

g,  (t)<gQ(t)<g2(t) , tei. 

A family  G with  the  density  property  is  said  to  be  varisol- 
vent , if  with  each  g€G  an  integer  m(g)  can  be  associated 
such  that  G is  locally  solvent  at  g and  satisfies  Property  Z 
both  with  degree  m(g) . 

Originally  varisolvency  was  introduced  without  postula- 
ting the  density  property  (iii).  In  1968  Dunham  discovered 
that  this  hypothesis  or  something  similar  must  be  added  to 
produce  an  elegant  theory  ( 18] . On  the  other  hand  the  density 
property  is  not  independent  from  the  other  postulates  ( c.  f . 

I 3,25]  ). 

THEOREM  4.2.  For  each  f€C (I ) there  is  at  most  one  best 
approximation  in  a varisolvent  family  G . Moreover . g€G  is  a 
best  approximation,  iff  f-g  alternates  at  least  m(g)  times. 

Here,  alternating  is  understood  as  in  the  classical 
theory  for  linear  Haar  subspaces. 

We  mention  this  theory  not  only  because  of  its  merits 
for  applications  (see  below)  but  because  of  its  clear  struc- 
ture. Observe  that  for  the  definition  of  varisolvency  (i) 
is  a local  property  whilst  (ii)  is  a global  one.  We  note 
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that  the  local  degree  cannot  be  larger  then  the  global  one. 
Rice's  theory  makes  clear  in  which  sense  the  connection 
between  local  and  global  behaviour  is  needed  in  the  analytic 
approach. 

When  Rice's  theory  is  applied  to  families  of  actual 
interest,  the  degrees  from  Definition  4.1  must  be  calculated. 
Here,  the  theory  of  Meinardus  and  Schwedt  1 28]  yields  an 
effective  tool  and  has  also  prepared  the  way  for  the  more 
modern  treatment. 

At  this  point  is  seems  appropriate  to  discuss  the  role 
played  by  the  parametrizat ions . In  his  original  paper  Rice 
formulated  his  results  for  families  which  are  given  in  a 
parametrized  form 

G*  {f (a , . ) ; a€A} . 

Here  A is  a (locally  compact)  parameter  set  and  F :AxI  >!R 

a continuous  mapping.  This  has  led  to  confusion  in  the  litera- 
ture. In  particular  one  has  to  distinguish  between  the 
topology  of  C(I)  and  the  topology  of  the  parameter  set. 
However,  Rice's  results  may  be  formulated  and  derived 
without  any  reference  to  parameters. 

On  the  other  hand  the  use  of  local  parameters  is  often 
advantageous.  It  may  even  be  considered  as  a typical  pheno- 
menon of  nonlinear  analysis  that  no  global  parametrization 
is  taken;  the  choice  of  parametrization  depends  on  the  ele- 
ment in  the  family,  which  (and  the  neighborhood  of  which)  is 
actually  being  considered.  A byproduct  of  the  theory  of 
Meinardus  and  Schwedt  | 28]  is  the  following. 


LLMMA  4.3.  Let  GCC(I) . 
open  set  AC TRm  into  G . 


Assume  that  F is  mapping  from  an 


differentiable  in 
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space"  d^F(ff\m)  i s a n m-dimensional  Haar  subspace  of  C(I) 
then  G is  locally  solvent  of  decree  in  at  g=F(a). 

The  lemma  is  easily  proved  by  the  use  of  the  classical 
imp  1 i c i t~f unc  t ion- theorem . 

Obviously  a family  G is  varisolvent,  if  for  each  $^G 
there  is  a local  parametrizat ion  as  given  in  the  lemma  whose 
"dimension"  m coincides  with  the  degree  m(g)  of  Property  Z 
at  g.  The  density  postulate  causes  no  trouble  here,  because 
the  tangent  space  is  a Haar  subspace  and  contains  a positive 
f unc t ion . 


EXAMPLE  A. A.  Let  R,,  , ^r>0  denote  the  set  of  rational  func- 

tions  given  in  (1.1).  The  defect  of  a function  p/q  is 


del 


r 

min  (K  - '-ip  , r-'<iq) 


if  p=0, 
if  pi  0 . 


Rj,  is  varisolvent  and  the  degree  at  p/q  is  m(g)=fc +r+l-def (g) 
+r+i  . 


EXAMPLE  A. 5.  Let  E^  , n^l  denote  the  set  of  proper  (sums  of) 
exponent  ia 1 s : 

n 


E„  - U-  I 


a e^ 

V 


1 ;a  , R e [R  J 

v v ‘ 


v=  i 

An  exponential  g has  the  order  k=k(g),  if  k is  the  least  num- 
ber such  that  g^E.  . It  is  shown  in  | 2|  that  E ° is  vari- 
R n 

solvent  and  that  the  degree  at  g is  n+k(g)^2n. 


The  metric  projection  (if  it  exists)  is  continuous  if  the 
image  point  is  normal  in  G. 


DEFINITION  A. 6.  An  element  gCG  is  a normal  point  in  a 

varisolvent  family  G if  one  of  the  following  equivalent  pro- 

pert  i es  holds: 
o 

1 The  degree  is  constant  in  a neighborhood  of  g. 

2 There  i s a neighborhood  of  g in  G,  the  closure  of  which 
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is  compact. 

3 ‘ A neighborhood  of  g i_rr  G is  a manifold  (i.e.  is  homeo- 
morphic  to  an  open  set  in  a Euclidean  space). 

There  is  no  convention  in  the  literature  as  to  which 
property  is  to  be  used  in  other  contexts  than  the  theory  of 
var i sol vency . To  complete  the  confusion  we  emphasize  that 
Condition  1°  is  not  equivalent  to  the  statement  that  the  de- 
gree is  maximal.  This  is  shown  by  an  example. 

EXAMPLE  4.7.  Let  C=C^JG2  where 

Gj=  {g(t)=a, 

G2=  {g=ueU,  a>0,  ef:R}. 

G is  a connected  set.  It  is  varisolvent  of  degree  m at  g,  if 
j€G^ , m=l,2.  Apart  from  g=0  all  elements  are  normal  in  G. 

5 Critical  point  theory.  Manifolds. 

The  theory  of  varisolvent  families  requires  a very  strong 
global  assumption  (Property  Z)  on  zeros.  This  is  due  to  the 
fact  that  the  families  need  not  be  manifolds.  Indeed,  the 
rational  functions  in  Example  4.4  and  the  exponentials  in 
Example  4.5  do  not  form  manifolds.  It  is  the  aim  of  this 
section  to  weaken  the  global  assumptions  as  much  as  possible. 
The  first  step  into  this  direction  was  done  in  1971  by  Wul- 
bert  I 37 , 3B| . 

To  illustrate  the  basic  concept  we  consider  a simple  exam- 
ple (c.f.  Figure  2).  Let  G be  a smooth  curve  in  2-space 
satisfying  the  following  local  property:  the  gradient  shall 
be  positive  (or  negative,  resp.)  at  each  point.  It  follows 
from  this  fact  already  that  there  is  a parametr izat ion  such 
that  each  coordinate  is  a strictly  monotonic  function.  The 
curve  is  a uniqueness  set  for  the  Chebyshev  approximation. 
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Fig.  2:  Curve  with  nonzero  gradient  in  JR'*  . 

The  generalization  to  manifolds  with  a dimension  greater 
than  one  is  by  no  means  trivial.  (Compare  with  the  fact  that 
Brouwer's  fixed  point  theorem  is  trivial  only  for  the  dimen- 
sion one,  because  in  that  case  it  reduces  to  the  intermediate 
value  theorem.) 


The  appropriate  framework  is  critical  point  theory  | 5] . 

DEF IN  IT ION  5.1.  Let  C be  a non-void  subset  in  a normed  line- 
ar space  R,  and  ^G.  Then  the  tangent  cone  C G at  g to  G con~ 

g 

aJ.stgof  all  elements  l^R  with  the  following  property:  there 
is  a continuous  mapping  of  ( a ,1  | : / — * g €G  such  that  gf  =g  and 
II  g -g-  Anil  = o(/ ) as  A — >o. 

A 1 

DEFINITION  5.2.  Let  GCR  be  non-void  and  f£R.  Then  ^C  i s 

called  a critical  point,  if  0 is  a best  approximation  to  f-g 

in  C G . 

— g 

The  importance  of  these  definitions  becomes  obvious  from 
the  following: 


LEMMA  5 . 3 (Lemma  of  first  variation).  If g Is  a local  bes t 
approximation  to  f in  G,  thei.  g is  a cr i tical  point . 

Proof.  Assume  that  g is  not  a critical  point.  Then  there  is 

a tangent  vector  l^C  G such  that 

g 

C : *0  f-g»-ll  f-g-hll  -0. 

We  compute  the  distance  of  f from  the  elements  of  the  curve 


i 
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given  in  Definition  5.1: 

I f-g^ I <1 f-g-x hi +1 g -g~x hi 

1 f-g-hl  +(  l-> ) I f-gl  +o  (a  ) 

Cl  f-gl  -c> +o(/ )<\  f-gl  , 

provided  that  <■  is  sufficiently  small.  Hence,  g is  not  locally 
optimal.  D 


Note  that  the  main  step  in  the  proof  above  is  nothing 
else  than  an  application  of  Newton's  method. 


The  standard  application  of  the  lemma  is  the  following. 
One  tries  to  find  a linear  subspace  of  C G,  e.g.  by  looking 
for  an  appropriate  parametrization.  Then  the  criteria  from 
linear  theory  are  applied.  In  particular,  in  the  Chebyshev 
case  the  Kolmogorof f-criterion  may  be  used  [ 28) . 


The  tangent  cones  are  linear  sets,  if  G is  a manifold 
without  boundary. 

DEF IN IT  ION  5 . A . A subset  GCR  j s an  m-d imens iona 1 C * -submani- 
fold of  R,  if  for  each  g? G there  is  a neighborhood  UCG  and 
a one-one  mapping  F from  an  open  set  'xT-IR  onto  U.  Moreover , 

F is  assumed  to  be  continuously  Frechet  differentiable  on  W, 
and  d^F  to  be  injective  for  each  af=U . 

It  is  crucial  for  critical  point  theory  that  the  con- 
verse of  Lemma  5.3  does  not  hold  in  general.  But  it  may  be 
proved  for  the  situation  given  in  Theorem  5.6.  At  first  we 
regard  Fig.  3 which  illustrates  two  different  situations  in 
2-space.  In  the  left  figure  g is  a critical  point  to  f^  and 
f,,  since  the  connecting  line  is  orthogonal  to  the  tangent 
line.  However,  g is  a local  best  approximation  only  with  re- 
spect to  fj  and  not  to  f^.  The  right  figure  refers  to  the 
sup-norm.  Then  g is  a critical  point  and  a local  best  approxi- 
mation to  f.  It  is  a local  solution  to  each  element  on  the 
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Fig.  3.  Critical  points,  in  the  situation  of 
Euclidean  norm  and  sup-norm. 


ray  through  f with  center  g. 


To  put  thi s observat ion  into  a rigorous  theory  the  follow- 
ing concept  was  introduced  by  Wulbert  [ 3 7 J . 


DEFINITION  5.3.  A C * -submani fold  GCC ( I ; is  Haar  embedded,  if 
all  tangent  spaces  (tangent  cones)  are  Haar  subspaces  of  C ( l ) 

THEOREM  5.6.  Let  GCC ( I ) be a Haar  embedded  manifold  and  let 

f€C ( I > . Then  g is  a local  best  approximation  to  f i n G , iff 
g is  a critical  point. 


Proof . Because  of  Lemma  5. I it  is  only  necessary  to  prove 
the  "if"-part. 


Assume  that  g^  is  a critical  point  to  f in  G.  Let 
F:W  >G  be  a parame t r i zat ion  sucn  that  F(o)=g  . The  para- 


metrization  induces  a mapping  from  FCW)  into  G 

g 

g *h*dQF(F  *g). 

It  follows  from  the  differentiability  that 


G, 

o 


II  g-g  “hJ “o (I  hi ) . 

By  the  definition  of  critical  pomes  0 is  a best  approxi- 
mation to  f-g  in  the  Haar  subspace  C G.  As  is  known  from 

O y 

6o 
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Linear  theory,  then  0 is  also  a strong  best  approximation. 
This  means  that  we  have  for  some  c^O: 

II  f-g  -hll  ill  f-g  — o I -ec II  hll  , 

o o 

whenever  h is  a tangent  vector.  Combining  both  inequalities 
we  obtain 

I f-gll  ill  f-g  -h#  -II  g-g  -hll 

o o 

i II  f-g  II  -ec  II  hll  -O  (II  hll  ) i I f -gl  , 

if  II  g-g  II  is  sufficiently  small.  0 

o 

Now  we  turn  to  the  development  of  the  global  result. 

As  usual,  for  a € /R  the  Level  set 

cM^GjIlf-gka} 

is  introduced. 

THEOREM  5.7  (Deformation  Theorem;.  Let  G be  a C * -manifold . 

If  the  set  {f^G;  II  f-gll  ^ f is  compact  and  contains  no  criti- 

rj  _ g 

cal  point,  then  G is  a strong  deformation  retract  of  G . 

Outline  of  proof.  Consider  a non-critical  point  g^ . Let  F be 
a parametr izat ion  for  some  neighborhood  U of  gf  in  G.  Since 
g is  not  critical,  there  is  a parameter  vector  b such  that 

g =F  (a  ),  II  f-g  -d  F bMIf-gfl  . 

o o o a 

o 

With  the  same  arguments  as  in  the  proof  of  Lemma  5.3  we  get 

II  f-F (a-t-A b ) II  ■ I f-F(a)H 

for  all  parameters  a in  some  neighborhood  of  a^  and  suffi- 
ciently small  a.  We  may  choose  a cut-off-function  k-  such  that 

r (a  ) ^0  and 
o 

I f-F(a+A<(a)- b)l <1 f-F(a)l , OcA- I , 
whenever  F(a)GU.  Consequently,  by  the  transformation 
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F K 

g ► a ► a+A  K(a)-  b ► g_ 

a llow 

<t>  :l  o , 1 1 * G — > G 

has  been  constructed  such  that 
$(0,g)-g, 

■f-*(A,g)l'  <f-gl  .0- ■ A v!  - 

In  particular,  the  inequality  is  strict  when  g=g  . 

o 

The  proof  of  the  theorem  proceeds  by  covering  the  set 
{gGG,  av.1  f-gH  } by  a finite  number  of  open  sets  such  that 
for  eacli  point  at  least  one  of  the  corresponding  flows  redu- 
ces the  distance  to  f.  By  glueing  the  flows  together  and 
using  compactness  once  more,  we  obtain  for  each  g€Gl  a path 
of  descent,  which  ends  in  Cl.D 

As  a consequence  we  have  | 5,38]  : 

THEOREM  3.8  (Uniqueness  Theorem) . Each  connected,  bounded  ljr 
compact , Haar  embedded  submanifold  of  C(l)  ^s  a Chebyshev  get 

REMARK.  Haar  embeddednes  may  be  considered  as  an  assumption 
on  zeros  with  local  structure.  Assumptions  referring  to  glo- 
bal properties  are  only  connectedness  and  boundedly  compact- 
ness. 

Proof . Assume  that  there  are  two  local  best  approximations 
g and  g in  G.  Since  G is  connected,  they  belong  to  the 

same  component  of  G , if  8 is  sufficiently  large.  The  set  of 

R 

critical  points  is  closed,  hence  compact  if  restricted  to  G . 

Consequt nt 1 y , we  may  choose  two  local  solutions  g^,g.  (which 

3 s 

possiblv  are  and  g^)  in  the  component  such  that  there  is 
no  further  critical  point  between  the  levels  a*raax  {Hf-gJ, 
i “ 3 , 4 } and  8.  However,  the  local  solutions  g^  and  g^  must  be 
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. . . _ Ct  + £ _ . , 

contained  in  distinct  components  of  G , c sufficiently  small. 
This  contradicts  the  deformation  theorem.  0 

6 Approximation  by  sums  of  exponentials 

In  non-linear  Chebyshev  approximation  the  rational  func- 
tions and  the  exponentials  are  the  families  which  are  the 
most  investigated.  Whilst  the  rational  functions  show  a good 
behaviour,  the  treatment  of  exponentials  is  very  involved. 

The  set  E°  of  proper  exponentials  given  in  Example  4.5 
n 

is  not  an  existence  set.  In  order  to  assure  existence  one  has 

to  consider  the  closure  of  E°. 

8 8 t n 8 

(6.1)  En={g=  l Pv(t)e  ; k=  l (l+9pvKn,  B^eTR}. 
v=l  v=l 

Here,  the  p^s are  polynomials  with  degree  3p^.  If  an  (extended) 
exponential  is  represented  as  in  (6.1),  then  8=8(g)  corresponds 
to  the  number  of  distinct  characteristic  numbers,  while  k=k(g) 
is  evaluated  with  counting  multiplicities.  The  first  rigorous 
proof  for  E^  being  an  existence  set  was  given  by  Werner  in 
1963.  The  reader  is  referred  to  E.  Schmidt's  paper  [33]  for  an 
easier  proof. 

As  was  observed  in  1968  [ 2]  the  extended  family  E is  not 

n 

varisolvent  fur  n>_2.  It  is  locally  solvent  of  degree  n+8  (g)  at 

g,  while  Property  Z holds  with  degree  n+k(g).  Note  the  gap 

between  8 and  k,  if  gSE\E°.  Nevertheless,  local  best  approxi- 

n n 

mations  may  be  characterized  by  an  alternating  of  the  error 
curve  [ 8]  . The  number  of  alternatings  lies  between  n+8  and  n+k. 

But  we  have  not  always  uniqueness.  By  using  Meinardus’  in- 
variance principle  examples  for  functions  with  two  best 

approximations  were  constructed  [2].  For  example,  to  f(x)  = 

2-1  ... 

( I +x  ) on  I=[-l,+l]  there  are  two  best  approximations  in  E^ • 

On  the  other  hand  it  was  also  possible  to  prove  with  classical 
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tools  that  there  are  at  most  two  solutions  in  For  several 

years,  however,  it  was  not  known  whether  the  number  of  solu- 
tions in  K , ni3,  is  bounded, 
n 

This  problem  was  attacked  by  using  the  tools  from  global 

analysis  described  in  the  last  section.  Three  years  ago  the 

author  anounced  the  result  that  there  are  at  most  n!  local 

solutions  in  E . Unfortunately,  there  was  still  a gap  in  the 

proof.  Anyway,  in  the  meantime  the  case  n=3  could  by  settled 

I 8|  . The  number  of  local  solutions  in  is  less  or  equal 

than  3,  and  this  bound  is  the  best  possible.  (More  generally, 

the  bound  lor  E cannot  be  less  than  n.)  We  hope  that  within 
n 

the  next  years  the  little  gap  in  the  proof  for  n^4  can  be 
closed  by  the  development  of  appropriate  perturbation  techni- 
ques. 

The  results  on  Haar  embedded  manifolds  may  be  applied. 

But  we  have  to  overcome  two  difficulties. 

At  lirst  t lie  theory  of  Haar  embedded  manifolds  must  be 
extended  to  include  manifolds  with  boundaries.  Then  the  tan- 
gent cones  are  no  longer  linear  spaces  but  proper  cones. 
Fortunately,  this  causes  only  complications,  which  are  techni- 
cal in  nature.  They  are  treatable  though  they  make  the  analy- 
sis a littly  clumsy. 

More  annoyance  is  caused  by  the  fact  that  E is  not  a 

n 

manifold.  This  becomes  already  obvious  Iron:  the  standard  re- 
presentation for  Ej. 


( ot , ft ) ► ote^ 

This  paramet r i zat ion  is  only  one-one,  if  we  contract  the  line 
j (O.ft).ftf  ft}  in  JR2  to  a single  point.  In  the  resulting  para- 
meter set  zero  is  a singular  point.  Note  that  the  singular 
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point  corresponds  to  an  exponential  which  has  not  the  maximal 
order.  If  we  want  to  obtain  a manifold,  then  we  have  to  con- 
sider E \E  , instead  of  E . 
n n- 1 n 

Refering  to  E. Schmidt's  compactness  results  for  exponen- 
tials, we  obtain  from  Theorem  5.8: 


LEMMA  6.1.  Each  component  of  (E  ) , , which  is  disjoint 

from  E | , contains  exactly  one  local  best  approximation. 


Now,  consider  a local  best  approximation  in  E and  the 

n 

components  of  the  corresponding  level  sets.  There  is  a smal- 
lest a such  that  the  level  set  with  level  a intersects  E 

n-  1 

A central  result  of  the  investigation  is  that  the  minimal 

intersection  contains  an  element  which  is  a local  solution 

with  respect  to  E |.  Consequently,  we  can  characterize  the 

solutions  in  E by  those  in  E 

n n- 1 

For  enumerating  the  solutions  in  E via  an  inductive 

n 

process  the  converse  problem  arises:  how  many  solutions  in 

E can  bifurcate  from  any  solution  g in  E , ? To  be  more 
n y n- 1 

• . V Ol 

precise,  consider  the  component  of  the  level  set  (E  ) , 

n 

a = H f-gll  , which  contains  g.  Into  how  many  parts  will  it  be 
splitted,  if  the  level  decreases  by  a small  number  e to 

(E  )a_C? 

n 


Let  us  return  for  a moment  to  the  discussion  of  the  sin- 
gular point  in  Ej.  From  the  viewpoint  of  differential  geometry 
it  is  clear  how  to  cancel  the  singularity.  One  has  to  blow  up 
the  point  to  a 1-dimensional  line.  The  question  arises  whether 
this  blowing  up  can  be  done  consistent  with  the  analysis. 


Indeed,  j*  E^_ j may  be  written  as  an  exponential  in  E^: 


g+O-e 


(it 


where  elements  with  different  E are  not  identified.  Now 

A (St 

descend  from  g+u^e  , u =0,  £S  fixed,  by  Newton  s method  to  a 
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lower  level.  It  turns  out  that  two  choices  3 and  3'  are  equi- 

. . . A , , 

valent  if  no  characteristic  number  of  g is  contained  in  the 
(closed)  interval  with  endpoints  3 and  3'.  Since  g has  at  most 
n- 1 characteristic  numbers  there  are  at  most  n inequivalent 
choices  of  3-  (There  is  still  some  trouble  for  the  neighbor- 
hoods of  the  spectrum  of  g) . By  an  induction  the  bound  n!  for 
the  number  of  possibilities  is  conjectured. 

7 Nonlinear  mean-squares  approximation 

One  ol  the  first  results  on  rational  approximation  with 

respect  of  I, -norms,  I <p<  , was  the  result  of  Cheney  and 

Goldstein  | 15]  that  each  (local)  best  approximation  in  R, 

«■  , r 

has  zero  defect.  Therefore,  one  of  the  complications  in  rati- 
onal Chebyshev  approximation  does  not  happen  here.  The  non- 
uniqueness  result  mentioned  in  the  introduction,  remembers 
us,  however,  that  we  cannot  expect  a nice  theory.  We  will 
return  to  the  non-uniqueness  problem  later. 

At  first  we  will  have  a glance  at  the  general  theory. 
During  tiie  last  years  the  lemma  of  first  variation  (Lemma  5.3) 
was  (re-)  discovered  by  several  authors.  A deeper  analysis  is 
possible  if  we  restrict  our  attention  to  , or  more  generally 
to  a Hilbert  space  H with  inner  product  | , ] . It  turns  out 

that  most  investigations  are  based  on  the  following  analysis 
of  second  order  terms. 

Let  A be  an  open  set  in  n-space,  and  let  F:A  — * G be  a 

parametr izat i on.  Assume  that  the  first  and  the  second  deriva- 
2 . 

Live  d F and  d F in  the  sense  of  Frechet  exist.  Here,  d F is  a 
a a o a 

• , n . . . 
linear  t ransformat i on : R » H and  d^F  is  a bilinear  sym- 
metric form:  TP1'*  TR1'  >11.  If  the  kernel  ker  d F consists  only 

a 

of  the  zero  element,  then  C , ,G=d  F(R  ). 

r ( a ) a 

The  analysis  is  based  on  the  square  of  the  distance  func- 
tion 
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p (a)  =11  f— F (a) II  2=  [f-F  [a]  ,f-F(a)|  . 

The  derivatives  of  p are  easily  derived. 

d p*  - 2 (f-F(a)  ,d  F] 
a a 

1 d2p=[d  F,d  F)  - [ f-F  (a)  ,d  2F| 

2 a a a a 

An  element  g=F(a)  is  a critical  point,  if  d p=0.  Concerning 

a 

second  order  terms  the  following  definitions  are  useful  | 29] . 

DEFINITION  7.1.  Let  F(a)  be  a critical  point.  The  number  of 

2 . . 

negative  eigenvalues  of  d p is  called  its  index,  and  the 
2 . a 

dimension  of  kerd^p  is  called  its  nullity.  A critical  point 
is  degenerate,  if  its  nullity  is  greater  then  zero . 

Note  that  the  classical  criteria  say  the  following:  If 
a critical  point  is  a local  solution,  then  its  index  is  zero. 
On  the  other  hand  each  critical  point  with  vanishing  index 
and  nullity  is  a local  best  approximation. 

While  the  meaning  of  the  index  is  obvious,  it  is  not  so 
quickly  understood  for  the  nullity.  Non-degenerate  critical 
points  are  isolated.  Moreover,  the  location  of  a non-degenera- 
te critical  point  is  a continuous  function  of  f (Compare  the 
problem  of  the  continuity  for  the  metric  projection).  The  most 
important  result  incorporating  nullity  is  obtained  in  connec- 
tion with  a solar  property. 

2 

The  expression  for  d p splits  into  two  terms  in  a natural 

a 

way.  The  term  | d F,d  F]  is  called  the  first  fundamental  form; 
a a 2 

the  other  one  [F(a)-f,d  F]  is  the  second  fundamental  form. 

a 

If  Fla]  is  a critical  point  to  f,  then  it  is  also  criti- 
cal, when  the  element 

fA«F(a)+X(f-F(a)),  A>0 

is  to  be  approximated.  The  corresponding  second  derivative  is 
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[d  F,d  F|-A[  f-F(a),d2Fl  . 
a a a 

Hence,  only  the  second  fundamental  form  depends  on  X.  There- 
fore, the  determination  of  the  degenerate  situations  corres- 
ponds to  an  eigenvalue  problem.  If  F(a)  is  a degenerate  cri- 
tical point  to  f , then  f is  called  a focal  point. 

LEMMA  7.2.  (Morse's  index  theorem)  The  index  of  a non-de- 
generate critical  point  g is  equal  to  the  number  of  focal 
points  which  lie  on  the  segment  from  f _to  g ; each  focal  point 
being  counted  with  its  multiplicity . 

The  index  vanishes  for  sufficiently  small  X.  This  corres- 
ponds to  the  fact  that  the  curvature  (which  is  defined  by  the 
n eigenvalues  mentioned  above)  is  finite. 

The  situation  for  nonlinear  uniform  approximation  is 
different.  Chui  and  Smith  [ 1 6 1 observed  recently  that  a cri- 
tical point  in  a smooth  manifold  need  not  be  a local  solution, 
even  if  the  element  to  be  approximated  is  arbitrarily  close. 
Another  example  is  given  in  Fig.  4.  In  the  lignt  of  Rice's 
definition  of  (a  scalar)  curvature  [ 32]  , one  might  say  that 
the  curvature  is  infinite.  But  this  definition  has  a serious 
disadvantage:  the  curvature  is  here  not  an  intrinsic  property, 


Fig.  4.  Example  for  critical  points  which  are 
not  local  best  approximations,  when 
the  tangent  line  does  not  satisfy  the 
Haar  condition. 
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The  splitting  of  the  second  derivatives  into  the  two 
fundamental  forms  is  also  used  for  the  proof  the  following 
theorem  which  was  first  given  by  Wolfe  [ 36]  for  the  case 
8 <r . 

THEOREM  7.3.  Assume  that  p./q-SR^  \R^  _ ^ r_  | > 3q.=r, 

i=l,2,...,N,  are  such  that  q.  and  q.  have  no  common  factors 

! J ; 

unless  i=i.  Then  there  area  f£L„  and  N elements  g.GR,  which 
2 — l *-  , r 

differ  from  p^/q^  by  a polynomial  and  which  are  local  best 
approximations  to  f . 

REMARK.  One  has  g.=p./q.  if  8^r. 

1 L 1 

Sketch  of  proof . The  main  idea  of  the  proof  is  to  solve  the 
equations 

[f-gi,  da  F]  = 0 -j 

\ ) i=l,2,...,N. 

[ f-g.,  da  F]  = 0 \ 

i 

The  first  set  of  equations  assures  that  all  g^'s  are  critical 
points,  while  the  second  set  causes  that  the  second  fundamen- 
tal forms  vanish.  Since  the  first  fundamental  form  is  defi- 
nite, all  g.'s  are  local  best  approximations. 

2 

The  derivatives  d F and  d F do  not  change,  if  a poly- 

cl  2L 

nomial  is  added.  Consequently,  the  equations  above  are  linear 
equations  for  f and  (g^-p^/q.),  i=l,2,...,N.  The  existence  of 
a solution  is  established  after  expressing  the  derivatives  in 
terms  of  appropriate  rational  functions.  0 

From  the  theorem  it  follows  immediately  that  there  is  no 
uniform  bound  for  the  number  of  local  best  approximations  in 
R^  ^ . Therefore,  the  situation  is  even  worse  than  in  the 
Chebyshev  approximation  by  exponentials. 
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8 Approximation  by  interpolation 

Up  to  now  we  have  always  considered  the  concept  of  best 
approximation.  One  can  also  try  to  find  a good  approximation 
by  interpolating  at  some  points.  However,  there  does  not  al- 
ways exist  a solution  when  one  wants  to  interpolate  by  rati- 
onal functions,  by  exponentials  or  by  spline  functions  with 
free  knots.  Curiously,  the  conditions  which  garantee  existen- 
ce, are  very  similar  in  the  three  examples. 

At  first  let  us  focus  our  attention  to  rational  interpo- 
lation [6].  The  following  interpolation  problems  was  already 

2 

considered  by  Cauchy  in  ld23.  Given  6+r+l  points  (t.,y^)G/R  , 

i=0, 1 , . . • ,8 +r,  the  function  p/q€  R is  an  interpolant,  if 

y » r 

p(t. ) 

=y.,  i=0, 1 ,...,£  +r . 

q(t.) 

If  p/q  interpolates,  then  we  have 

p(t  J-y  ,q(t . )=0,  i=0,l  , . . . ,8+r. 

This  system  of  (ft+r+1)  linear  homogeneous  equations  for 
(8+r+2)  polynom-coeff icients  has  always  a solution.  But  the 
solution  may  be  only  a formal  solution.  It  may  have  poles 
between  the  given  t^'s. 

The  following  result  generalizes  a result  of  Lowner  [26]. 

THEOREM  8.1.  Let  S^r-1  and  1=1-1, + 1]  . Assume  that  fSC(I)  has 
an  integral  representation 

+1  C-r+1 

f(t)=p(t)+  / du(t), 

-1  I — t x 

where  p is  a polynomial  of  degree  8-r  and  du  is  a non-nega- 
tive measure  with  infinite  carrier.  Then  given  t .£ 1 , 
i=0,  1 , . . . ,8  +r , there  is  a r which  interpolates  f at  the 
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given  points.  Moreover,  the  interpolation  error  is  pointwise 
smalier  when  compared  with  the  interpolation  by  polynomials 
of  degree  (6+r). 

For  exponentials  or  spline  functions  with  free  knots 
similar  results  are  obtained  if  the  (iterated)  Cauchy  kernel 
tm(l-xt)  * in  the  representation  is  replaced  by  the  exponen- 
tial kernel  exp(tx)  or  the  truncated  power  kernel  (t~x)^, 
resp.  The  connection  of  the  interpolation  problems  with  moment 
problems  is  very  obvious  in  the  treatment  of  the  exponential 
case  in  [ 22) , while  the  approach  in  [21]  shows  that  homotopy- 
techniques  are  appropriate  tools,  when  the  abscissa  are  not 
equally  spaced. 
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RECENT  RESULTS  ON  PACE  APPROXIMANTS  AND  RELATED  PROBLEMS 

Charles  K.  Chui 

The  purpose  of  this  paper  is  to  study  the  recent  develop 
ment  of  certain  aspects  of  Pad6  approximants  and  related 
problems.  For  instance,  convergence  results  of  the  approxi- 
mants and  their  connection  with  best  local  approximation  will 
be  discussed. 


1 . Introduction. 

The  subject  of  Pad£  approximants  is  fairly  old.  It  date 
back  to  as  early  as  Cauchy  (1789-1857)  and  Jacobi  (1804- 
1851)  but  was  first  treated  in  detail  by  Frobenius  in  1881 
[63].  Under  the  influence  of  Hermite  (1822-1901)  at  the 
Ecole  Normale  Superieure,  Pad6,  in  his  1892  dissertation 
[127],  classified  these  rational  fraction  approximants  (now 
known  as  Pad6  approximants),  arranged  them  in  a table  (which 
is  now  called  the  Pad£  table)  and  studied  the  structure  of  it 
in  detail.  In  those  early  days,  Pad6  approximants  were  con- 
sidered as  another  form  of  continued  fraction,  which  was 
already  a well  established  subject.  Indeed,  if 


f(z)  = bQ  + 


1+ 


V 

a2z 


= c^  + c-^z  + c2z  + 


1 + . 


and  Pq(z)  = b0>  P^z)  = b0  + a^z  , P2(z)  = b^  + a^/U+a^)  , 
. . . are  the  truncations  of  the  above  continued  fraction, 
then  the  even  ones  P^,  P2>  P^,  ...  occupy  the  main  diagonal 
and  the  odd  ones  P^ , P^,  P,.,  . . . march  down  the  [n+l/n] 
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diagonal  of  the  Pade  table  of  the  formal  power  series 

2 

Cq  + c^z  + c^z  + . . . . References  to  the  subject  of  Pad4 

approximants  and  continued  fractions  can  be  found  in  the  books 
Baker  [7],  Balk  [12],  Luke  [105,  106],  Perron  [131]  and  Wall 
[157],  and  Proceedings  of  conferences  [10],  [84]  and  [92],  as 
well  as  the  articles  by  Gragg  [81]  and  Wynn  [175]. 

During  the  early  1960's,  physicists  found  that  Pade 
approximants  are  very  useful  as  a systematic  method  of  ex- 
tracting more  information  from  (formal)  power  series  expan- 
sions, and  the  method  has  been  especially  successful  in 
critical  phenomena.  In  scattering  theory  and  quantum  field 
theory  where  series  of  Stieltjes  occur  frequently,  the 
approximants  are  known  to  converge  to  the  expected  solutions 
rapidly.  However,  when  the  series  do  not  have  the  "positive 
character",  much  still  has  to  be  understood.  The  main  contri- 
butors in  this  direction  include  the  physicists  or  chemists 
Baker,  Bessis,  Chisholm,  Fisher,  Gammel,  Nuttall,  Wheeler, 
and  others.  The  interested  reader  should  consult  the  articles 
listed  in  the  references.  It  should  be  noted,  however,  that 
hundreds  of  papers  related  to  Pad6  approximants  have  been 
written  during  the  past  twelve  years,  and  Graves-Morris  of 
the  Mathematical  Institute,  University  of  Canterbury,  Kent, 
England,  is  preparing  a bibliography. 

In  the  next  section,  we  will  discuss  the  definition  of 
Pade  approximants  and  several  generalizations.  The  third 
section  of  this  paper  will  be  devoted  to  the  important  question 
of  convergence.  In  the  fourth  or  final  section,  the  related 
problems  of  best  local  approximation  and  best  quasi-rational 
approximation  will  be  treated. 
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2 . The  Pad 6 approximants . 


The  definition  of  Pad£  approximants  is  a natural  exten- 
sion of  that  of  the  Taylor  series.  Let 

no 

(2.1)  f(z)  = l c zk 


be  a formal  power  series  with  comp]ex  coefficients  c^,  and 

let  m and  n be  non-negative  integers.  The  [m/nj  Pade 

approximant  of  f (at  the  origin)  is  the  unique  rational 

function  [m/n]  = [m/n](f)  = p /q  where  q / 0 and 

m n n 


p (z)  = / a,  z and  q (z) 

‘ m Ir  n 


).  bL  Z 


k=0 


k=0 


such  that 

f (z) q (z)  - p (z)  = d Zrn+T1+^  + higher  order  terms, 
n m 

The  existence  of  p and  q follows  immediately  from  con- 

m n 

sidering  the  system  of  equations 


i 

(2.2)  ) c 

k=0 

j-k 

_ ) 

1 

b 
1 0 

for  j 
for  j 

= 0, . . . , m 
= m+1 , . . . ,nrt-n 

(where  we  set  b. 

k 

0 

if 

k > n) , 

and  the  uniqueness  of  the 

ratio  p /q 
m n 

is  an 

easy  exercise. 

Hence,  the  following 

table,  which 

is  called 

the 

Padd  table,  is  obtained: 

[0/0] 

[ 1/0  l 

[2/0] 

• • 

[0/1] 

[1/1] 

[2/1] 

• • • 

[0/2] 

[1/2] 

[2/2] 

• • • 

1 


It  is  clear  that  the  first  row  is  actually  the  sequence  of 
partial  sums  of  the  formal  series  f itself.  We  include 
some  other  useful  equivalent  definitions  in  the  following 
theorem . 
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THEOREM  2.1.  Let  f be  the  formal  power  series  (2.1)  and 

let  f , be  the  (m+nf*1  partial  sum  of  f.  Then  each  of  the 

following  statements  is  equivalent  to  the  definition  of 

Pad6  approximants  [m/n]  = p /q  of  f: 

“ m n — 

(a)  f(z)q  (z)  - p (z)  = s zU  + higher  order  terms,  where 

n m — “■ 

p < 00  is  as  large  as  possible  and  q^  { 0. 

(b)  f(z)  - p (z)/q  (z)  = t z + higher  order  terms,  where 

m n — 

v 1 “ is  as  large  as  possible  and  q^  t 0. 

(c)  (f  q - p ) ^ (0)  = 0 for  j = 0,...,  m+n  , and 

m+n  n m 

q tO. 
n 


Statement  (b)  can  be  found  in  Pad6  [127].  We  remark 
that  v may  happen  to  be  smaller  than  m+n+1,  although  it  is 


clear 

that  p > 

m + n + 1 . 

However,  if  the 

series 

(2.1) 

normal 

(cf.  Frobenius  [63] 

),  that  is,  if 

c 

u 

Cu-1 

' ' Cu-v+l 

(2.3) 

CU+1 

c 

u 

‘ ' Cu-v+2 

* 0 , 

cu+v-l 

Cu+v-2  ' 

. . c 

u 

where 

c 

: = 0 

w 

if  w < 0, 

for  all  u 

and 

O 
A 1 
> 

then 

it  can 

be  shown 

that  v = 

m+n+1  also. 

The  normality 

condition  is  a very  important  one  in  the  theory  of  Pade 

approximants;  for  example,  when  f is  normal,  the  fractions 

p /q  are  in  irreducible  form  for  all  m and  n,  so  that 
m n 

the  Pad4  table  of  f has  no  blocks  of  order  larger  than  one 

(cf.  Gragg  [81]).  If  f is  a function  of  continuity  class 
m+n 

C at  the  origin,  then  we  can  use  (c)  as  the  definition 
of  the  [m/n]  Pad£  approximant  of  f as  in  Chui,  Shisha  and 


Smith  [34]  by  replacing  f 


m+n 


by  f in  (c),  namely, 


(2.4)  (f  q - p ) 
n m 


(j) 


(0)  = 0 , j = 0,...,  m+n. 
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We  call  (2.4)  the  Pad£  equations  in  [34],  One  advantage  of 
this  definition  is  that  we  do  not  have  to  write  down  the 
Taylor  expansion  of  f at  0. 

It  is  intuitively  clear  from  the  definition  of  Pade 
approximants  that  they  are  good  approximants  at  the  origin. 
It  is  also  clear  that  they  are  quite  computable  if,  for 
example,  the  coefficients  c^  of  the  formal  power  series 
are  known  (or  can  be  estimated).  Many  algorithms  have  been 
written;  we  only  mention  Claesson  [37]  and  Gragg  [81]  and 
refer  the  reader  to  the  references  at  the  end  of  this  paper. 

Next,  we  mention  several  important  generalizations  of 
the  Pad£  approximants: 

For  each  k,  we  let  t be  a polynomial  of  degree 
k and  leading  coefficient  equal  to  1,  so  that  t r 1, 
and  consider  the  formal  polynomial  series 


(2.5)  f(z)  = l c.  t (z). 

k=0  K k 


Suppose  that  there  exist  polynomials  p and  q , q t 0 , 

m n n 

with  degrees  m and  n respectively,  such  that 


(2.6)  f(z)q  (z)  - p (z)  = c t (z)  + d t „(z)  + • • 

n m m+n+1  m+n+2 

we  say  that  p /q  is  a generalized  fm/n]  Pad6  approxi- 
m n 

mant  of  f.  The  difficulty  in  proving  existence  here  is 
that  we  do  not  have  the  simple  relationship  t t = t 


m n 


m+n 


(as  in  the  case  of  a formal  power  series  with  t (z)  = zn) , 

n 

so  that  the  equations  (2.2)  are  not  satisfied. 

For  instance,  let  (6.)  , k = 1,2,...,  be  a preassigned 
sequence  of  (interpolation)  points  in  the  complex  plane 
and  let 


(2.7)  t^(z)  - t^(z)  = ^ (Z_B.)  for  k>  1 

j = l 
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Then  (2.5)  is  called  a formal  Newton  series.  It  can  be  proved 

in  this  case  (cf.  Gallucci,  Jones  [64],  Karlsson  [95,  96]  and 

Warner  [164,  165])  that  polynomials  p^  and  which 

satisfy  (2.6)  exist  and  that  the  ratio  p /q  is  unique. 

m n 

Here,  p /q  is  called  the  [m/n]  Newton-Pad4  approximant 
m n 

of  f in  (2.5).  In  the  special  case  if  the  interpolation 
points  6^  are  distinct,  and  the  Newton  series  (2.5)  con- 
verges to  f uniformly  on  a set  containing  the  points  8^ , 

then  p / q is  the  "best"  interpolating  rational  function 

m n 

of  f at  (8  } of  degree  (m,n)  in  the  sense  that  if 

r is  any  rational  function  of  degree  (m,n)  such  that 

m , n 

r interpolates  f in  at  least  the  same  subset  of 

m,n 

(B 8 , as  p /q  then  we  have  r I p /q  . 

1 nrt-n+1  m n m,n  m n 

The  more  general  case  when  we  have 


tn(z)  1,  t (z)  = II  (z-B.  ')  for  k > 1 , 

U * j = l J 

(k) 

where  (6.  } , j = l,...,k  , k = 1,2,...,  are  points  in 

the  complex  plane,  has  also  been  studied  by  Karlsson  [95, 

96],  and  Warner  [164,  165].  In  this  case,  assuming  f to 

be  of  continuity  class  in  an  "open"  set  containing 

(k) 

the  points  {8.  } , j =l,...,k  , 1 <k<m  + n + l, 

we  have  to  define  p^  and  q^  in  a slightly  different  way, 
namely,  instead  of  (2.6),  they  should  satisfy 

(2.8)  (f  q - p ) (Bfk))  = 0 , q t 0,  j = l,...,k  , 
n m j n 

and  k = 1 , . . . ,m+n+l , 


where  for  each  k,  derivatives  of  order  p , p = 0,...,v-l  , 

(k) 

have  to  be  taken  if  v of  the  points  8.  coincide.  Of 

(k)  1 

course,  if  all  of  the  8.  are  equal  to  0,  then  (2.8) 
becomes  the  set  of  Pad4  equations  as  in  (2.4).  We  call 
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p /q  the  best  interpolating  rational  function  of 

m n (k) 

degree  (m,n)  to  f at  {g^  } (or  an  (m+n+1 ) -point  Padd 

approximant) . 

Now  suppose  that  the  polynomials  t^  are  normalized 
Chebyshev  polynomials,  that  is 


-k+1  -1 

t^  = 1,  t^(x)  = ^ cos  k cos  x , x e [-1,1],  k > 1. 


Then  (2.5)  is  called  a formal  Chebyshev  series.  In  this  case, 
existence  and  uniqueness  of  the  generalized  Padd  approximants 
have  been  studied  by  Clenshaw  and  Lord  [40]  and  Maehly  [109]. 
Also,  Gragg  and  Johnson  [83]  rewrote  the  Chebyshev  series 
as  a formal  Laurent  series  and  hence  studied  the  so-called 
Laurent-Padd  approximants.  Other  formal  series  of  orthogonal 
polynomials  have  been  considered  by  Holdaman  [89]. 

Another  natural  and  at  least  computationally  useful 
generalization  is  to  consider  formal  power  series  of  two 
variables.  The  research  in  this  direction  is  quite  recent. 

It  has  been  initiated  by  Chisholm  [30]  in  1973,  and  then 
extended  in  1^74  and  1975  by  the  Canterbury  group  (cf.  [86], 
[133]  and  the  references  there).  The  approximants  are 

now  called  Canterbury  approximants.  The  idea  of  this 
extension  is  that  if  one  of  the  two  variables  is  kept 
constant,  then  one  should  get  the  one-variable  Padd  approxi- 
mants. 

The  concept  of  Padd  approximants  can  also  be  generalized 

in  the  following  way.  Let  f ,...,f  (N  > 2)  be  formal 

1 N — 

power  series  (of  the  same  variable)  as  in  (2.1).  Determine 
polynomials  p , of  degrees  n^,...,n^  respectively, 
such  that  not  all  of  these  polynomials  are  identically 
zero  and 
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n +. . .+n  +N-1 

(2.9)  + ...  + fN(z)pN(z)  = a z 1 + 

higher  order  terms. 

This  problem  was  first  posed  by  Pad£  [129]  in  1894,  and  later 
has  been  studied  in  detail  by  Jager  [90]  and  de  Bruin  [26]  in 
their  1964  and  1974  Amsterdam  theses  respectively.  Of 

course  if  N = 2 and  f = -1,  (2.9)  becomes  the  original 

1 2 
Pad6  problem.  Let  N = 3,  f^  = 1,  = f and  f = f ; 

then  we  obtain  the  quadratic  Pade  approximants  as  considered 
by  Shafer  [144].  Joyce  and  Guttmann  (cf.  Gammel  [67]) 
recently  considered  the  special  case  where  f = f, 
f2  = f',...,fN  = f^N  This  special  case  is  important 

in  physical  problems  concerning  critical  phenomena,  and  it 
is  especially  important  when  N = 3,  since  for  the  case 
where  p^,  p^>  are  restricted  to  polynomials  of  degrees 

6,  8 and  9 respectively  it  is  related  to  the  Onsager  solu- 
tion for  the  free  energy  of  an  Ising  model  on  a square 
lattice . 

Other  important  generalizations  include  operator  Pade 
approximants.  An  interested  reader  should  consult  the 
articles  listed  in  the  references. 

3.  Convergence  of  Pad4  approximants. 

The  problem  of  convergence  of  a certain  sequence  of 
Pad£  approximants  is  in  general  a very  difficult  but  cer- 
tainly interesting  and  important  one.  The  choice  of  a 
suitable  sequence  from  the  Pad6  table  depends  very  much  on 
the  analytic  character  of  the  function  that  the  (formal)  power 
series  represents.  As  one  might  expect,  the  convergence 
proofs  usually  rely,  either  directly  or  indirectly,  on 
some  knowledge  of  the  location  of  the  poles  of  the  approxi- 
mants; and  finding  information  on  the  location  of  the  poles 
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is  a very  important  unsolved  problem  in  the  theory  of  Padd 
approximants . We  divide  our  discussion  of  the  convergence 
problem  into  the  following  four  categories. 

I.  Convergence  along  rows . For  a fixed  positive  integer  j, 
the  j*''1  row  on  the  Padd  table  is  the  sequence  { [n/j  — 1] } , 
n = 0,  1,  2,  ...  . Hence,  if  f is  a holomorphic  function 
in  the  disc 

Ar={z:|z|<R},  R > 0 , 

then  the  first  row,  being  the  sequence  of  partial  sums  of 
the  Taylor  expansion  of  f at  0,  certainly  converges 
uniformly  on  every  compact  subset  of  A to  f.  Suppose 
again  that  f is  holomorphic  in  A . The  problem  of 
convergence  along  other  rows  is  much  harder.  In  fact, 

Perron  [131;  p.  270]  gave  an  example  of  an  entire  function  f 
such  that  the  poles  of  the  [n/1]  Padd  approximants  of  f 
form  a dense  set  in  the  complex  plane.  Hence,  one  has  to 
go  to  subsequences.  In  [15],  Beardon  indeed  proved  that 
if  f is  holomorphic  in  A , then  there  is  a subsequence 
of  the  sequence  of  [n/1]  Padd  approximants  that  converges 
uniformly  on  every  compact  subset  of  AD  to  f.  Very 
recently,  Baker  and  Graves-Morris  [11]  proved  that  the 
third  row  [n/2]  has  the  same  property.  But  nothing  else 
is  known  for  the  other  rows.  However,  if  f has  a finite 
number  of  poles  in  A then  there  is  the  following  famous 
classical  result  of  de  Montessus  [119]. 

THEOREM  3.1  Let  f be  holomorphic  in  A„  except  for  N 

poles  z1 zN  with  a total  multiplicity  M > N such 

that  z1 zN  ? 0.  Let  AR  = AR\{z  ,. . . ,zN>.  Then  the 

sequence  { [n/M] } , n = 1,2,...,  of  Padd  approximants  of 
f converges  uniformly  on  every  compact  subset  of  AR  to  f 
as  n -*•  °°  . 
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Wilson  [168]  has  extended  the  above  result  to  the  con- 
vergence along  the  (M  + p)1"*1  rows  (M  > 1)  for  certain 
"smooth"  functions  f which  even  include  the  cases  when  f 
may  have  non-polar  singular  points  on  the  boundary  of  A 

R 

(cf.  [7,  p.  143]).  The  result  of  de  Montessus  was  also 
proved  for  (n+M+1)  - point  Pade  approximants  by  Saff  [138], 
which  lead  to  the  recent  work  of  Karlsson  [94,  95,  96]  and 
Warner  [164,  165]  on  the  general  interpolation  problem  as 
mentioned  in  section  2.  As  a consequence  of  Theorem  3.1, 
it  is  not  difficult  to  show  (cf.  Wallin  [158])  that  if  f 
is  holomorphic  in  A except  for  infinitely  many  poles  at 
K = { , . . . } , such  that  0 t K , then  there  exist  sub- 
sequences (m^  ) and  (n^i  t*ie  sequence  of  positive 

integers  such  that  the  [m^/n^]  Pade  approximants  of  f 
converge,  as  j ■*"  ® , uniformly  on  each  compact  subset  of 
Ar\K  to  f. 

I I . Convergence  along  the  main  diagonal . 


The  problem  of  convergence  of  the  sequence  {[n/n])  of 
Pade  approximants,  which  occupy  the  main  diagonal  of  the  Pade 
table,  is  especially  interesting.  However,  except  in  the 
case  of  a series  of  Stieltjes  (and  related  ones)  it  has  been 
difficult  to  obtain  satisfactory  results.  We  first  state 
the  following  conjecture  given  by  Baker,  Gammel  and  Wills 
(cf.  [6;  p.  23  - p.  24]). 

CONJECTURE  3.1  Lei  f be  holomorphic  in  A = (z  : |z|  < ll 
except  for  a finite  number  of  poles  in  0 < \z\  < 1 and 
except  for  z = 1 where  f is  continuous  relative  to  A ^ . 
Then  there  exists  a subsequence  of  the  sequence  of  [n/n] 

Pade  approximants  of  f that  converges  to  f uniformly  on 
every  compact  subset  of  A^\k,  where  K _is  the  set  of  poles 
S2±  f . 
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No  proof  and  counterexamples  to  this  conjecture  are 
known.  Perhaps  the  first  breakthrough  in  convergence  along  the 
main  diagonal  is  the  following  result  of  Nuttall  [121]. 

THEOREM  3.2.  Let  £ be  any  fixed  integer . _If  f is  a 
meromorphic  function  in  the  whole  complex  plane  and  does  not 
have  a pole  at  the  origin,  then  the  sequence  { [n  + £ /n ] } , 

n = 1,2,...,  of  Pade  approximants  of  f converges  in 
measure  to  f as  n -*•  00  . 

Pommerenke  [132]  extended  the  above  result  of  Nuttall 
to  convergence  in  capacity  for  a slightly  larger  class  of 
functions,  namely  those  holomorphic  in  the  whole  complex 
plane  except  for  a set  of  capacity  zero  which  does  not  con- 
tain the  origin.  Pommerenke  considered  sequences  [m/n]  with 
c‘*'n<m<cn,c>l.  Recently,  Goncar  ([79]  and  cf.  [95]) 
has  generalized  the  results  of  Nuttall  and  Pommerenke  still 
further.  Another  important  result  is  the  following  theorem 
due  to  Wallin  [159]. 

THEOREM  3.3.  Let  a > 0 and  (n^)  be  a subsequence  of  the 
sequence  of  positive  integers.  Suppose  that  f(z)  = Z a^z~* 
is  holomorphic  in  A„  and  satisfies 

00  a/n, 

(3.1)  £ (max  { | a . | : n,<  j < 2n,}  ) < 00  . 

k=l  j k - k 

Then  the  sequence  {[n^/n^]}  of  Padd  approximants  of  f 
converges  uniformly  on  each  compact  subset  of  D\E  tio  f 
as  k ->  00  , where  E is  a set  of  a - dimensional  Hausdorff 
measure  zero. 

In  the  above,  E is  a set  such  that  given  any  e > 0, 
it  can  be  covered  by  the  union  of  countably  many  discs  with 
radii  r^  such  that  Z r”  < t.  For  a = 2,  E is  a set 
of  (two-dimensional)  Lebesgue  measure  zero.  We  remark  that 
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the  above  theorem  was  actually  stated  for  a sequence  [m^/n^] 
in  [159]  under  a little  different  condition,  and  that  the 
theorem  is  sharp  in  the  sense  that  for  each  a > 0,  and 
any  set  E with  a - dimensional  Hausdorff  measure  zero 
not  containing  the  origin,  there  exist  an  entire  function  f 
and  a sequence  (n^)  such  that  (3.1)  is  satisfied  and  such 
that  the  sequence  of  Pade  approximants  [n^/n^]  of  f is 
divergent,  and  even  unbounded,  everywhere  on  E.  We  take 
a > 2,  then  since  the  whole  complex  plane  is  of  a - 
dimensional  Hausdorff  measure  zero,  We  obtain  the  following 
corollary. 

COROLLARY  3 . 1 There  is  an  entire  function  f such  that 
the  sequence  of  [n/n]  Pad4  approximants  of  f is 
divergent  everywhere  in  the  whole  complex  plane  except  at 
the  origin. 

As  mentioned  previously,  the  difficulty  in  a conver- 
gence proof  is  that  it  is  in  general  not  easy  to  have  any 
control  of  the  poles  of  the  approximants.  In  fact,  Wallin 
[159]  has  proved  that  for  any  sequence  of  points  z^  t 0, 
k - 1,2,...,  in  the  complex  plane,  and  of  integers  n^  with 

nk+l  > 

[n^/n^]  Pad6  approximant  has  a pole  at  z^,  k*l,2, ....  Perhaps 
the  following  theorem  due  to  Jones  and  Thron  [93]  which 
relates  the  convergence  and  boundedness  of  a sequence  of 
Pad4  approximants  is  of  some  interest. 

THEOREM  3.4  Let  {m^}  and  (n^)  be  sequences  of  non- 

m. 

negative  integers  such  that  ft  K < « for  some  e , 

0 < e < 1.  Let  f be  a formal  power  series  with  Pad£ 
approximants  [m^/n^]  and  D a domain  containing  the 
origin . Then  the  sequence  {[m^/n^]}  converges  uniformly 
on  each  compact  subset  of  D if  and  only  if  for  al 1 


2n^,  there  exists  an  entire  function  for  which  the 
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sufficiently  large  k,  the  approximants  [m^/n^]  are 
uniformly  bounded  on  every  compact  subset  of  D. 

For  a series  of  Stieltjes  which  will  be  discussed 
next,  we  do  have  some  control  of  the  poles  (and  actually 
know  where  they  lie)  for  certain  approximants. 

III.  Series  of  Stieltjes.  The  only  simple  general  class 
of  formal  power  series  whose  "diagonal"  Pad6  approximants 
are  known  to  converge  nicely  is  the  class  of  series  of 
Stieltjes.  A formal  power  series 


(3.2)  f (z)  = l c zj 

J-0  J 

is  called  a series  of  Stieltjes  if  there  is  a real-valued, 
bounded,  nondecreasing  function  p assuming  infinitely  many 
values  on  [0,  <*>)  such  that 


(3.3)  c = r tjd  p(t)  , j = 0,  1,  2 

J 0 

It  was  Stieltjes  [148]  who  first  proved  that  the  moment 
problem  on  [0,  °°)  associated  with  the  sequence  {c^} 
determinate  if  and  only  if  the  corresponding  continued 
fraction  expansion  of  f converges  everywhere  in  the  com- 
plex plane  except  on  [0,  <*>) . Since  the  truncations  of  the 
continued  fraction  expansion  of  f are  the  [n/n]  or 
[n+l/n]  Pad4  approximants  of  f (cf.  section  1),  it 
follows  that  if,  say, 


(3.4)  l c ~1/2j  = co 

J-l  J 

(cf.  Carleman  [30];  for  instance,  Cj  = 0 ( ( 2 j ) ! R^)  for 
some  R > 1) , then  each  of  the  sequences  {[n/n]}  and 
{[n+l/n]}  converges  to  the  Stieltjes  integral 
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(3.5) 


00 


du(t) 
1 - zt 


uniformly  on  every  compact  set  in  the  complex  plane  which  is 
disjoint  from  [0,°°),  the  positive  half  of  the  real  axis.  This 
result  can  also  be  obtained  by  using  determinant  theory 
[7,  11],  Schwinger  variational  principle  [1,  3],  theory  of 
orthogonal  polynomials  [97],  or  by  just  observing  that  the 
[n+  £/n],  £ > -1  , Pad6  approximants  are  certain  Gaussian 
quadratures  of  the  integral  (3.5)  (cf.  [2]).  We  summarise 
some  useful  explicit  results,  which  also  give  error  estimates, 
in  the  following  theorem  (cf.  [3,  97]). 


THEOREM  3.5 
with  measure 
Suppose  that 

L (z)  5 
n 


Let  f(z)  = E c zJ  be  a series  of  Stieltjes 
dp  as  described  above  and  let  £ > - 1 . 


"n  ,£ 


(z>  = l 

j=0 


b . z~ 

n.J 


are  the  orthogonal  polynomials  with  respect  to  {[0,°°)  ; 

t+1 

t du(t)}  with  leading  coefficients  b = b , zeros 
“ — - n,n  n 

at  x . and  Christoffel  numbers  (or  Gaussian  weights) 

— n,k  

a ,k=l,...,n.  Then  the  [n+£/n]  Pad£  approximant  of 
n y K 

f is  given  by 


P .p(z)  l , p.->  n 

[n+£/n]  (z)  = — = £ c.  z1  + z £ 

Q (z)  j=0  J k=l  1 " z xn,k 


n,k 


which  is  the  Gaussian  quadrature  approximation  to 

~ £+1 


£ 

l cj 

j=0  J 


,1  . £+1 

z + z 


ir  c-  dp(t)  = f dp(t) 
j i - zt  y i - zt  ’ 
0 0 
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where 


n-1  ^ n 

P .0(2)  = y Z y b 0 . , c and 

n +£  . Lr.  . L . n , 2n-j-k  n-j  ’ 

k=0  j=n-k  J 


Q (z)  = l b , z 
n k=0  n’n'k 


The  error  is  given  by  the  following  formulae 


00 

En(z)  5 E„ ,l(z)  5 / - ln+£/n|  (z> 


r ^ 

uh  1 - 2t 


dp  (t) 


2n+£+l  f L (t)  t 

z 1 n 

n L (z_1)  Jo  1 ~ zt 


n+£+l 


dp(t) 


00  2 J 

( "n  (0  1 

-!)  J 1 - zt 


Ln2(Z_1) 


dp  (t) 


The  poles  of  [n+£/n]  are  simple  and  located  on  [0,<=°)  ; 
and  on  0),  we  have 


(-1)  { [n+^+l/n+1 ] - [n+£/n]}  > 0 , 

(-1)  {[n+^+l/n-1]  - [n+£/n]}  < 0 , and 

£+1 

(-1)  E > 0. 
n 

Hence,  the  convergence  result  follows  immediately  if 
l > -1.  Analogous  results  for  the  case  £ < -1  seem  to  be 
unknown.  From  the  above  theorem,  we  only  have  convergence 
along  the  diagonals  [n+£/n],  £ > -1  , parallel  to  the  main 
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diagonal  of  the  Pad£  table.  From  a recent  communication  by 
Pommerenke,  we  learnt  that  Stahl  [147]  has  recently  proved 
that  if  m = c n,  where  c is  some  positive  constant,  then 
the  sequence  of  [m/n]  Padd  approximants  of  a series  of 
Stieltjes  off  whose  corresponding  measure  dp  has  support 
on  a bounded  interval  [a,b],  converges,  as  n -*■  °°  , 
uniformly  on  each  compact  set  disjoint  from  the  following 
set 


If  the  series  (3.2)  comes  from  an  integral 


dp  (t) 
1-zt 


where  dp  > 0 and  C is  an  arc  in  the  complex  plane, 
analogous  convergence  results  can  still  be  obtained  (cf. 
Nuttall  [125]).  Recently,  Nuttall  ([126]  and  private  communi- 
cation) has  obtained  results  in  a much  more  general  case 
where  the  weights  are  allowed  to  be  complex-valued. 


IV.  Convergence  for  certain  functions.  As  mentioned  above, 
for  series  that  are  not  those  of  Stieltjes,  or  related  series 
considered  by  Nuttall  and  others,  it  has  been  difficult  to 
obtain  general  convergence  results.  In  Balk  [12; 
p.  249  - p.  257]  and  Perron  [131  ; p.  246  - p.  248],  conver- 
gence results  have  been  obtained  for  the  functions 
2 n 


z r n n 
e , I q z , 


+ l 


1 + q 


+ If 


(b+n+1) 


and  fl  (l-q11  z)  . 
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More  recently,  convergence  results  for  the  Pade  approxlmants 

of  other  functions  including  certain  ratios  of  trigonometric 

polynomials  [51],  functions  of  the  form  I a / (1-3  z) 

n n 

[60,  61,  69],  quasi-meromorphic  functions  (where  a finite 

number  of  essential  isolated  singular  points  are  allowed) 

foz 

[50],  functions  of  the  form  a e II  (1  - a z)/(l  - 6.  z) 

[5],  and  ratios  of  certain  special  functions  (cf.  [105,  106, 

107])  have  been  obtained.  In  particular,  Pade  approximants 

2 

of  the  important  function  e have  been  studied  in  detail 

by  Saff  and  Varga  [139,  140]  and  Saff,  Varga  and  Ni  [142].  Of 

special  interest  is  that  the  optimal  sequence  of  approximants 

— z 

from  the  Pade  table  of  e which  minimizes  the  error  on 
[0,°°)  is  the  sequence  {[n/3n]}. 


4 . Best  local  approximation. 

In  1934,  Walsh  [160]  noted  that  the  Taylor  polynomial 
^k=0akz  » t^ie  [n/  0]  Pade  approximant , of  a 

function  holomorphic  at  the  origin  could  be  obtained  by 
taking  the  limit,  as  e -*■  0+  , of  the  net  of  n^  degree 
polynomials  which  best  approximate  f in  the  closed  discs 
under  the  uniform  norm.  Later  [161,  162],  he  proved 
the  following  result  for  other  Pad4  approximants. 

,m+n+l , 


THEOREM  4.1  Let  f be  in  C 
write 


[0,6]  for  some  6 > 0 and 


(4.1)  f(x)  = a + a x + ...  + a xm+n  + 0(xm+n+^) 
0 1 m+n 


r = r (f)  denote  a rational  function  of 

E E ' 


as  x -*•  0 . Let 
degree  (m,n)  which  best  approximates  f 


in  the  uniform 


norm  on  the  closed  interval  [0,  e],  0 < e < <5  . Suppose 

that  the  determinant 
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a 

m 

am-l 

m-n+1 

3m+l 

a 

m 

am-n+2 

1 

m+n-1 

am+n-2 

. . . a 

m 

where  a . =0  i_f  j < 0 . Then  as  g -*■  0+ 
converges  to  the  [m/n]  Padd  approxlmant 
on  any  closed  interval  [0,  e^]  , 0 < < 

[m/n]  = [m/n](f)  Is  analytic. 


/ 0 and  a ^ 0 

, the  net  {r  } 
e 

of  f , uniformly 
<5  , on  which 


The  normality  condition  (4.2)  of  f is  needed  in 


Walsh's  proof  to  guarantee 


r - f 

G 


' ' [0,G] 
[m/n]  - f 


= 0 (em+n+1) 


which  follows  from  the  fact  that  f 

) (cf.  section  2).  In  [34],  Chui,  Shisha  and  Smith 
used  an  approximation  theoretic  proof  to  obtain  the  following 
result,  with  the  normality  condition  dropped. 


THEOREM  4.2.  Let  f be  a real-valued  function  in  Cm+n+'*' [ 0 , 6 ] 

for  some  5 > 0 , and  for  each  g , 0 < e < <S  , 

let  r^  be  the  rational  function  of  degree  (m,n)  which 

best  approximates  f on  [0,e]  in  the  uniform  norm.  Then 

the  net  { r } converges  uniformly  to  the  [m/n]  Pade 

approximant  of  f on  some  closed  interval  [0,e_], 

+ 0 

0<GQ<6,as  g->0 


The  above  mentioned  results  lead  to  the  concept  of 
best  local  approximation  which  we  describe  as  follows: 

Let  M be  a class  of  functions  in  C[0,  6]  , 6 > 0 . 
Suppose  that  for  each  e > 0 with  0 < e < 6 , a function 
f g C [0,  <5  ] has  a best  uniform  approximant  Pc(f)  on 
[0,  g]  from  M ; that  is 

llp£(t)  - f|l[0,c]  * lnf{|lp  - f|l[0,e] 
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If  as  e -*  0+  , the  net  {p  (f)}  converges  uniformly  on 

some  closed  interval  [0,£.q]  , 0 < e^  ' 6 , to  some  function 

p^f)  e M , we  say  that  p^(f)  is  a best  local  approximant 

of  f from  M.  Hence,  if  M is  the  class  of  all  rational 

functions  of  degree  (m,n)  , the  above  theorem  says  that 

for  every  (real-valued)  function  f e C,+n  f 0 , o ] , 6 > 0, 

the  [m/n]  Pade  approximant  of  f is  the  unique  best  local 

approximant  of  f from  M.  For  u ,...,un  e C [0,6], 

we  denote  by  S = S (u,,...,u  ) the  algebraic  span  of 
n n 1 n 

{u, ,...,u  }.  In  [35],  Chui,  Shisha  and  Smith  obtained  the 

1 n 

following  result. 


THEOREM  4.3.  Let  {u, ,...,u  } C Cn  [0,6]  for  some  positive 

— — ] n - — * 

6 , such  that  S is  a Haar  subspace.  For  each 
f e Cn  [0,e] , 0 < e < 6 , Jjrt  p (f)  denote  the  best 
uniform  approximant  of  f on  [0,e]  from  S . Then  the 
net  { p ( f ) } converges , as  e + 0+  , uniformly  on  some 

[0,e0J,  0 < e-  < 4 , to  some  function  p^(f  ) e S^  for 
each  f e Cn [ 0 , e ] if  and  only  if  the  Wronskian  matrix  at  0 


(4.3)  A 

n 


’u  (0) 

u2(0) 

..  u (0) 
n 

u]<n_1)(0) 

u^n_])(0) 

(n-1 

. . u 

n 

(0) 


is  nonsingular.  Furthermore , if  dot  i 0 , we  have 

(P0(f)  - f)(])(0)  =0  , .1  = 0,  . . . ,n-1  . 

In  [36],  Chui,  Smith  and  Ward  considered  the  problem 
of  best  local  approximation  using  the  L^,[0,e]  norm.  There, 
by  using  a different  method,  better  results  can  be  obtained 
under  some  weaker  continuity  conditions. 

Now  suppose  that  the  matrix  A^  in  (4.3)  is  singular; 
it  seems  to  be  an  interesting  problem  to  study  what  functions 
f c cn  [0,6]  have  the  property  that  the  nets  {p( (f)}  of 


97 


CHARLES  K.CHUI 


best  approximants  would  converge  as  e •>  0 . To  study  this 

problem,  the  notion  of  Taylor  rank  was  used  in  [35].  Let 
u , ...,un  e C [0,  d]  be  sufficiently  smooth.  The  smallest 
integer  N such  that  the  N x n matrix 


(4.4) 


UN  = 


Uj^  (0) 

u2(0) 

Un  (0) 

u(N-l) (o) 

. . . 

u(N'1)(0) 

n 

has  rank  n is  called  the  Taylor  rank  of  the  system 


{V“*’Un> 


(We  require  at  least 


u 


1’ 


,u  g CN  [0,  6] 
n 


and  remark  that  N > n) . Hence,  if  det  A / 0 , then 

- n 

N = n and  IT  = A . Let  ft..  be  the  image  of  R , the 
N n N 


real  Euclidean  n-space,  under  the  transformation  U 
following  result  was  obtained  in  [35]. 

THEOREM  4.4.  Let  N be  the  Taylor  rank  of 
.i  ^ r ro 

J1 


N 


The 


{u1,...,u  } C C [0,  6]  and  let  f e [0,  6]  where 


5 > 0.  Suppose  that  the  N-vector 

f E (f(0) 

lies  in  ft., 

N 

converges  to  some  Pg(f)  e 


approximants  of  f 
as  e -*■  0+  . Furthermore, 


, ...  , f(N  1} (0)  ) 

Then  the  net  { p^ ( f ) } of  best  uniform 
from  S 


(P0(f)  - f) 


(j) 


(0)  = 0 for  j = 0,...,N-1  . 


We  remark  that  the  Haar  condition  is  not  required  in  the 
above  theorem.  Next,  since  Pade  approximants  are  defined  by 
rational  functions  p /q  with  (f  q - p ) (x)  = 0(xm+n+^')  , 
it  seems  natural  to  consider  approximation  pairs  (p,  q) 
such  that  fq  - p is  "smallest".  More  precisely,  let  P 
and  Q be  finite  dimensional  subspaces  of  C[0,6]  , 

6 

where 


0.  Assume  that  dim  P = m , dim  Q = n and  dim  Q = n - 1 
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Q0  = {q  e Q : q(0)  = 0)  • 

Let  = Qg  + 1 , and  for  each  f c C [0,e]  , 0 < e < 6 , 
consider  the  minimization  problem 

(4.5)  inf  (||fq  ~ Pll[0  e]  : P E p • q £ • 

The  minimizing  pairs  (p^  , q^)  can  easily  be  shown  to 
exist.  If  the  subspace 

Rf  E P + fQ0 


is  Haar  on  [0,Eq]  , then  for  each  e , 0 < e < , 

the  minimizing  pair  (p^,  q£)  for  ^ is  also  unique.  Let 

} and  { q , . . . , q } be  bases  of  P and  Q_ 

1 m 1 n 1 ^ | ^ ^ 

respectively  and  assume  that  P,  Qq C C [0,6]  , 6 > 0 . 

For  each  f , define 

Ip  if  1 < i < m 

1 

fq.  if  m + l<i<m  + n-  l 

l-m 


where  m > 1 and  n > 2 . As  a consequence  of  Theorem  4.3, 
we  have  the  following  result  (cf.  [35]). 

THEOREM  4.5.  Let  f e Cm+n-1[0,6]  , where  6 > 0 , be 
such  that  the  Wronskian  matrix  at  0 


(4.6)  B , .(f)  = 

m+n-1 


(0) 


(m+n-2) 


m+n-1 


(0) 


(0)  .. 


(m+n-2) 

m+n-1 


(0) 


is  nonsingular.  For  each  e>0,0<e<6,  let 
(p^,  q^)  be  the  (unique)  minimizing  pair  for  f from 

P x Q as  described  above.  Then  the  net  { (p  , q )} 

1 + £ e 

converges,  as  e 0 , to  a pair  (p^,  q^)  t P x . 

Furthermo re , 
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(f  - ~ ) (x)  = 0(xm+n  1) 

qo 

+ 

as  x -*  0 

The  uniqueness  of  the  minimizing  pairs  follows,  since 
det  Bm+n  ^(f)  ^ 0 implies  that  the  space  is  Haar  in 

[0,  eQ]  for  some  eQ  > 0 . In  [35],  (P(),  qQ)  = (pQ(f ) ,qQ(f ) ) 
is  called  the  best  local  quasi- rational  approximant 
of  f from  P x Q^.  As  an  example,  if  we  take  P and  Q 
to  be  the  subspaces  of  polynomials  of  degree  m and  n 
respectively,  the  above  theorem  gives  the  following  result 
which  relates  to  [m/n]  Pade  approximants . 

COROLLARY  4.1.  Let_  f be  in  Cm+n+1[0,6]  for  some 
6 > 0 , and  write  f as  in  (4.1).  Suppose  that  the 
normality  condition  (4.2)  of_  f is  satisfied.  Then  there 
Is  an  eQ  ’ 0 < eQ  - ^ ’ suc*1  that  for  each  e , 

0 < e < £q  , there  exists  a unique  pair  of  polynomials 
(P£,  1£)  > with  degree  < m , degree  q^  < n and 
dc(0)  = 1 i such  that  (p  , q„)  solves  the  minimization 

problem  (4.5),  and  such  that  the  net  { (p  , q )}  converges, 

+ e 
as  e -*■  0 , uniformly  on  some  [0,  e^]  , > 0 , to  a 

pair  of  polynomials  (pQ,  qQ)  , with  degree  pQ  < m , 

degree  ^ < n and  q^O)  = !•  Furthermore,  the  rational 

function  Pq/'Iq  is  the  [m/n]  Pad6  approximant  of  f. 

For  more  information  on  the  problem  of  best  local 
approximation  an  interested  reader  should  consult  [35,  36]. 


100 


RECENT  RESULTS  ON  PADE  APPROXIMANTS 


References 


1 Allen,  G.D.,  C.K.  Chui,  W.R.  Madych , F.J.  Narcowich  and 

P.W.  Smith,  Pade  approximation  and  orthogonal  poly- 
nomials, Bull.  Aust.  Math.  See.,  U1  (1974),  263-271. 

2 Allen,  G.D.,  C.K.  Chui,  W.R.  Madych,  F.J.  Narcowich  and 

P.W.  Smith,  Padd  approximation  and  Gaussian  quadrature 
Bull.  Aust.  Math.  Soc . , 1J.  (1974),  63-71. 

3 Allen,  G.D.,  C.K.  Chui,  W.R.  Madych,  F.J.  Narcowich  and 

P.W.  Smith,  Padd  approximation  of  Stieltjes  series, 

J.  Approx.  Theory,  14  (1975),  302-316. 

4 Allen,  G.D.  and  F.J.  Narcowich,  On  the  representation 

and  approximation  of  a class  of  operator-valued 
analytic  functions.  Bull.  Amer.  Math.  Soc.  <31  (1975), 
410-413. 

5 Arms,  R.J.  and  A.  Edrei,  The  Padd  tables  and  continued 

fractions  generated  by  totally  positive  sequences. 
Math.  Essays  dedicated  to  A.J.  Macintyre,  Ohio  Univ. 
Press,  Athens  (Ohio),  1970. 

6 Baker,  G.A.  Jr.,  Advances  in  Theoretical  Physics,  K.A. 

Brueckner,  ed.,  Vol.  1,  Academic  Press,  New  York,  1965 

7 Baker,  G.A.  Jr.,  Essentials  of  Padd  Approximants , 

Academic  Press,  London-New  York,  1975. 

8 Baker,  G.A.  Jr.,  Convergence  of  Pade  approximants  using 

the  solution  of  linear  functional  equations,  J.  Math. 
Phys. , 16  (1975)  , 813-822. 

9 Baker,  G.A.  Jr.,  A theorem  on  the  convergence  of  Padd 

approximants,  Studies  in  Applied  Math.  (To  appear). 

10  Baker,  G.A.  Jr.  and  J.L.  Gammel,  ed.,  The  Pade  Approxi- 

mant  in  Theoretical  Physics,  Academic  Press,  New  York- 
London,  1970. 

11  Baker,  G.A.  Jr.  and  P.R.  Graves-Morris , Convergence  of 

rows  of  the  Pade  table  J.  Math.  Analysis  and  Appls. 
(To  appear) . 

12  Balk,  M.B.  The  Padd  interpolation  process  for  certain 

analytic  functions,  Issledovaniya  po  souremennym 
problemam  teorii  funckcii  kompleksnogo  peremennogo- 
Gosudarstr.  Izdat.  Fiz.-Mat.  Lit.,  Moscow,  1960. 


101 


CHARLES  K.  CHUI 


13  Barnsley,  M. , The  bounding  properties  of  the  multipoint 

Padd  approximant  to  a series  of  Stieltjes  on  the  real 
line,  J.  Math.  Phys . , 14  (1973),  299-313. 

14  Basdevant,  J.L.,  D.  Bessis,  and  J.  Zinn-Justin,  Padd 

approximants  in  strong  interactions.  Two-body  pion 
and  kaon  systems,  II  Nuovo  Cimento,  Series  X,  60  A 
(1969),  185-238. 

15  Beardon,  A.F.,  The  convergence  of  Padd  approximants, 

J.  Math.  Analysis  and  Applications,  2/L  (1968),  344- 
346. 

16  Bessis,  D. , Topics  in  the  theory  of  Padd  approximants, 

in  Padd  Approximants,  Graves-Morris , ed.,  The  Institute 
of  Physics,  London,  1973. 

17  Bessis,  D.,  Padd  approximants  in  Quantum  Field  Theory, 

in  Padd  Approx . and  Their  Appl . , Graves-Morris, 
ed.,  Academic  Press,  New  York,  1973. 

18  Bessis,  D. , Construction  of  variational  bounds  for  the 

n-body  eigenstate  problem  by  the  method  of  Padd 
approximations,  in  Applications  of  the  Padd  Method 
in  Mechanics,  Springer  Verlag  Lecture  Notes  in 
Physics,  (To  appear). 

19  Bessis,  J.D.  and  J.D.  Talman,  Variational  approach  to 

the  theory  of  operator  Padd  approximants.  Rocky 
Mountain  J.  Math.,  4_  (1974),  151-158. 

20  Bessis,  D.  and  M.  Villani,  Padd  approximation  in  non- 

relativistic  and  relativistic  quantum  mechanics, 
International  Conference  on  Mathematical  Problems  of 
Quantum  Field  Theory  and  Quantum  Statistics,  Moscow, 
1972. 

21  Bove,  A.,  G.  Fano,  A.G.  Teolis  and  G.  Turchetti, 

Functional  integration,  Padd  approximants,  and  the 
Schrodinger  equation,  J.  Math.  Phys.,  16^  (1975), 
268-270. 

22  Brezinski,  C.,  Forme  confluente  de  1' e-algorithme 

topologique,  Numer.  Math.  2J3  (1975),  363-370. 

23  Brezinski,  C.,  Analyse  numerique. — Gdndration  de  suites 

totalement  monotones  et  oscillantes,  C.  R.  Acad. 

Sc.  Paris,  t.  280,  Series  A,  (1975),  729-730. 

24  Brezinski,  C.,  Rhombus  algorithms  connected  to  the 

Padd  table  and  continued  fractions,  (To  appear). 


102 


RECENT  RESULTS  ON  PADE  APPROXIMANTS 

25  Brezinski,  C.,  Computation  of  Pade  approximants  and  con- 

tinued fractions,  (In  manuscript). 

26  Bruin,  M.G.  de,  Generalized  C-fractions  and  a multi- 

dimensional Pade  table,  Ph.D.  Thesis,  Amsterdam, 

1974 . 

27  Bruin,  M.G.  de.  Three  classes  of  Pade  tables  whose 

upper  halves  are  normal,  (In  manuscript). 

28  Bruin,  M.G.  de,  Convergence  in  the  Padd  table  of 

^F^(l;c;x)  with  c i Z\N,  (In  manuscript). 

29  Bruin,  M.G.  de  and  H.  van  Rossum,  Formal  Pade  approxi- 

mation, Nieuw  Arch,  voor  Wiskunde  (3),  2J1  (1975), 
115-130. 

30  Carleman,  T.,  Les  fonctions  quasi-analytiques , Gauthier- 

Villars,  Paris,  1926. 

31  Chisholm,  J.S.R.,  Rational  approximants  defined  from 

double  power  series,  Math.  Comp.,  2_7  (1973),  841-848. 

32  Chisholm,  J.S.R.  and  P.R.  Graves-Morris , Generalizations 

of  the  theorem  of  de  Montessus  to  two  variable 
approximants,  (To  appear). 

33  Chisholm,  J.S.R.  and  J.  McEwan,  Rational  approximants 

defined  from  power  series  in  N variables,  Proc.  R. 

Soc.  Lond.  A,  (1974),  421-452. 

34  Chui,  C.K.,  0.  Shisha  and  P.W.  Smith,  Pade  approximants 

as  limits  of  chebyshev  rational  approximants,  J. 

Approx.  Theory,  12_  (1974),  201-204. 

35  Chui,  C.K.,  0.  Shisha  and  P.W.  Smith,  Best  local 

approximation,  J.  Approx.  Theory,  (To  appear). 

36  Chui,  C.K.,  P.W.  Smith  and  J.D.  Ward,  Best  I^  local 

approximation,  (To  appear). 

37  Claessens,  G.,  A new  look  at  the  Pad£  table  and  the 

different  methods  for  computing  its  elements,  J. 

Comp,  and  Appl.  Math.,  1 (1975),  141-152. 

38  Claessens,  G. , The  rational  Hermite  interpolation 

problem  and  some  related  recurrence  formulas,  (In 
manuscript) . 

39  Claessens,  G. , A new  algorithm  for  osculatory  rational 

interpolation,  (In  manuscript). 

40  Clenshaw,  C.W.  and  K.  Lord,  Rational  approximations 

from  Chebyshev  series,  Lanczos  Festschrift,  (To  appear). 


103 


CHARLES  K.  CHUI 


41  Cody,  W.J.,  A survey  of  practical  rational  and  poly- 

nomial approximation  of  functions.  Studies  in  Applied 
Mathematics,  6^  (1970),  86-109. 

42  Common,  A.K.  and  P.R.  Graves-Morris,  Some  properties  of 

Chisholm  approximants , J.  Inst.  Maths.  Applies., 

13  (1974),  229-232. 

43  Copley,  L.A.  and  D.  Masson,  Pade-approximant  calculation 

of  it -it  scattering,  Phys.  Review,  164  (1967),  2059- 
2062. 

44  Corbella,  O.D.,  C.R.  Garibotti,  and  F.F.  Grinstein, 

Rational  polynomial  approximants  and  scattering 
amplitude  for  long  range  potentials,  (In  manuscript). 

45  Critchfield,  C.L.  and  J.L.  Gammel,  Rational  approximants 

for  inverse  functions  of  two  variables,  Rocky 
Mountain  J.  Math.,  (1974),  339-349. 

46  Dahlquist,  G. , S.C.  Eisenstat  and  G.H.  Goloub,  Bounds 

for  the  error  of  linear  systems  of  equations  using 
the  theory  of  moments,  J.  Math.  Anal,  and  Appl.,  _34 
(1972),  151-166. 

47  Donnelly,  J.D.P.,  The  Fade  table,  in  Methods  of 

Numerical  Approximation,  D.C.  Handscomb,  ed.,  Pergamon 
Press,  Oxford,  1966. 

48  Ellis,  H.G.,  Continued  fraction  solutions  of  the  general 

Riccati  differential  equation.  Rocky  Mountain  J. 

Math.,  4 (1974),  353-356. 

49  Edrei,  A.  The  Pade  table  of  meromorphic  functions  of 

small  order  with  negative  zeros  and  positive  poles. 
Rocky  Mountain  J.  Math.  4^  (1974),  175-180. 

50  Edrei,  A.,  The  Pad6  table  of  functions  having  a finite 

number  of  essential  singularities,  Pac.  J.  Math., 

56  (1975),  429-453. 

51  Edrei,  A.,  Convergence  of  the  complete  Pade  tables  of 

trigonometric  functions,  J.  Approx.  Th. , V5  (1975), 

(To  appear) . 

52  Edrei, A. , The  complete  Pade  tables  of  certain  series  of 

simple  fractions,  Rocky  Mountain  J.  Math.,  (To  appear). 

53  Fair,  W. , Pad6  approximation  to  the  solution  of  the 

Ricatti  equation.  Math,  of  Comp.,  _18  (1964),  627-634. 

54  Fair,  W. , and  Y.L.  Luke,  Pad£  approximations  to  the 

operator  exponential,  Numer.  Math.,  _14  (1970), 

379-382. 


104 


RECENT  RESULTS  ON  PADE  APPROXIMANTS 


55 

56 

57 

58 

59 

60 
61 

62 

63 

64 

65 

66 
67 


Field,  D.A.,  Series  of  Stieltjes,  Pade  approximants  and 
continued  fractions,  (To  appear). 

Fisher,  M.E.,  Critical  point  phenomena — The  role  of 
series  expansions.  Rocky  Mountain  J.  Math.,  4 (1974), 
181-201. 

Fisher,  M.E.  and  H.B.  Tarko,  Tests  of  strong  scaling  in 
the  three-dimensional  Ising  model.  Physical  Review 
B,  U (1975),  1131-1133. 

Fleischer,  J. , J.L.  Gammel,  and  M.T.  Menzel,  Matrix 
Pade  approximants  for  the  and  -^P^  partial  waves 

in  nucleon-nucleon  scattering,  Physical  Review  D,  8 
(1973),  1546-1552. 

Fleischer,  J.  and  J.A.  Tjon,  Bethe-Salpeter  equation  for 
J=0  nucleon-nucleon  scattering  with  one-boson 
exchange.  Nuclear  Physics,  B 8h_  (1975),  375-396. 

Franzen, N. R. , Some  convergence  results  for  Pade 

approximants,  J.  Approx.  Theory,  6^  (1972),  254-263. 

Franzen,  N.R.,  Convergence  of  Pade  approximants  for  a 
certain  class  of  meromorphic  functions,  J.  Approx. 
Theory,  6 (1972),  264-271. 

Freud,  G. , Pade  approximation  and  Gaussian  quadrature, 
Joint  University  of  Texas — Texas  A&M  University 
Seminar  on  Approximation  Theory,  1975. 

Frobenius,  G. , Uber  Relationen  zwischen  den  Naherungs- 
briichen  von  Potenzreihen,  J.  fur  reine  und  angew. 

Math. , 90  (1881) , 1-17. 

Gallucci,  M.A.  and  W.B.  Jones,  Rational  approximations 
corresponding  to  Newton  series  (Newton-Pad6  approxi- 
mants), J.  Approx.  Theory,  (To  appear). 

Gammel,  J.L.,  Continuation  of  functions  beyond  natural 
boundaries,  Rocky  Mountain  J.  Math.,  4 (1974),  203- 
206. 

Gammel,  J.L.,  Effect  of  random  errors  (noise)  in  the 
terms  of  a power  series  on  the  convergence  of  the 
Pad6  approximants,  (To  appear). 

Gammel,  J.L.,  Review  of  two  recent  generalizations  of 
the  Pad6  approximant , Padg  Approximants  and  Their 
Applications , edited  by  P.R.  Graves-Morris , Academic 
Press,  1973. 


105 


CHARLES  K.CHUI 


68  Gammel,  J.L.  and  M.T.  Menzel,  Bethe-Salpeter  equation 

for  nucleon-nucleon  scattering  matrix  Padd  approximants. 
Physical  Review  D,  1_  (1973),  565-568;  II.  Physical 
Review  D,  11  (1975),  963-966. 

69  Gammel,  J.L.  and  J.  Nuttall,  Convergence  of  Padd  approxi- 

mants to  quasianalytic  functions  beyond  natural 
boundaries,  J.  Math.  Analysis  and  Appl.,  4J3  (1973), 
694-696. 

70  Gammel,  J.L.,  C.C.  Rousseau,  and  D.P.  Saylor,  A generali- 

zation of  the  Padd  approximant , J.  Math.  Anal,  and 
Appl.,  20  (1967),  416-420. 

71  Garibotti,  C.R.  and  F.F.  Grinstein,  A summation  procedure 

for  expansions  in  orthogonal  polynomials,  (In 
manuscript) . 

72  Garibotti,  C.R.  and  F.F.  Grinstein,  Summation  of  the 

Bom  series  in  potential  scattering,  (In  manuscript). 

73  Genz,  A.,  The  e-algorithm  and  some  other  applications 

of  Padd  approximants  in  numerical  analysis,  in  Padd 
Approximants  and  Their  Applications,  P.R.  Graves- 
Morris,  ed..  Academic  Press,  London-New  York,  1973. 

74  Gilewicz,  J.,  Method  of  the  determination  of  the 

conformal  mapping  by  the  accelerated  iterations  with 
a Pade  approximant,  (unpublished  manuscript). 

75  Gilewicz,  J. , Totally  monotonic  and  totally  positive 

sequences  for  the  Padd  approximants  method,  CPT-CNRS 
Marseille,  (To  appear). 

76  Gilewicz,  J.,  Meilleur  approximant  de  Pade  en  analyse 

numdrique,  (To  appear). 

77  Gilewicz,  J.,  Numerical  detection  of  the  best  Padd 

approximant  and  determination  of  the  Fourier 
coefficients  of  the  insufficiently  sampled  functions, 
in  Pade  Approx,  and  Their  Appl.,  Graves-Morris , ed., 
Academic  Press,  New  York,  1973. 

78  Golden,  J.E.,  J.H.  McGuire,  and  J.  Nuttall,  Calculating 

Bessel  functions  with  Pade  approximants,  J.  Math. 

Anal,  and  Appl.,  43.  (1973),  754-767. 

79  Goncar,  G.A.,  (Unpublished  manuscript). 

80  Gragg,  W.B.,  Truncation  error  bounds  for  n-f ractions , 

Bull.  Amer.  Math.  Soc.,  J76  (1970),  1091-1094. 


106 


RECENT  RESULTS  ON  FADE  APPROXIMANTS 


81  Gragg,  W.B.,  The  Pade  table  and  its  relation  to  certain 

algorithms  of  numerical  analysis,  SIAM  Review,  14 
(1972),  1-62. 

82  Gragg,  W.B.,  Matrix  interpretations  and  applications  of 

the  continued  fraction  algorithm.  Rocky  Mountain  J. 
Math.,  4 (1974),  213-225. 

83  Gragg,  W.B.  and  G.  D.  Johnson,  The  Laurent-Pade  table. 

Information  Process,  74_  (1974),  632-637. 

84  Graves-Morris,  P.R.  (ed.),  Pade  Approximants  and  Their 

Applications,  Proc.  Conference  at  University  of  Kent 
1972,  Academic  Press,  London-New  York,  1973. 

85  Graves-Morris,  P.R.,  R.  Hughes  Jones  and  G.J.  Makinson, 

The  calculation  of  some  rational  approximants  in  two 
variables,  J.  Inst.  Maths.  Applies,  13  (1974),  311-320. 

86  Graves-Morris,  P.R.  and  D.E.  Roberts,  Calculation  of 

Canterbury  approximants,  (In  manuscript). 

87  Heine,  E. , Handbuch  der  Kugelfunctionen:  Erster  Band, 

Theorie  der  Kugelfunctionen  und  der  verwandten 
Functionen;  Zweiter  Band,  Anwendunger  der  Kugel- 
functionen und  der  verwandten  Functionen,  G.  Reimer, 
Berlin,  1878,  1881. 

88  Henrici,  Peter  and  Pia  Pfluger,  Truncation  error 

estimates  for  Stieltjes  fractions,  Num.  Mathematik, 

9 (1966),  120-138. 

89  Holdaman,  J.T.  Jr.,  A method  for  the  approximation  of 

functions  defined  by  formal  series  expansions  in 
orthogonal  polynomials.  Math.  Comp.  ^3  (1969), 

275-288. 

90  Jager,  H.  , A multidimensional  generalization  of  the 

Pade  table,  Ph.D.  Thesis,  Amsterdam,  1964. 

91  Jones,  R.  H.,  General  rational  approximants  in  n 

variables,  (In  manuscript). 

92  Jones,  W.B.  and  W.J.  Thron,  eds..  Proceedings  of  the 

International  Conference  on  Pad6  Approximants, 

Continued  Fractions  and  Related  Topics,  Rocky 
Mountain  J.  Math.,  4^  (1974),  no.  2. 

93  Jones,  W.B.  and  W.J.  Thron,  On  convergence  of  Pade 

approximants  SIAM  J.  Math.  Analysis,  6^  (1975),  9-16. 


107 


CHARLES  K.  CHUI 


94  Karlsson,  J.,  Rational  interpolation  with  free  poles  in 

the  complex  plane.  Preprint  University  of  Umea, 
Department  of  Mathematics,  no.  6,  1972. 

95  Karlsson,  J.,  Rational  interpolation  and  best  rational 

approximation,  Preprint  University  of  Umea, 

Department  of  Mathematics,  no.  1,  1974. 

96  Karlsson,  J.,  Rational  Hermite  interpolation  procedures. 

Preprint  University  of  Umea,  Department  of  Mathematics, 
no.  2,  1974. 

97  Karlsson,  J.  and  B.  von  Sydow,  The  convergence  of  Pade 

approximants  to  series  of  Stieltjes,  (In  manuscript). 

98  Knowles,  J.B.  and  D.W.  Leggett,  An  analysis  of  trans- 

port delay  simulation  methods.  Radio  and  Electronic 
Engineer,  42  (1972),  172-178. 

99  Longman,  I.M.,  Computation  of  the  Pade  table,  Intern. 

J.  Computer  Math.  Section  B,  3 (1971),  53-64. 

100  Longman,  I.M.,  On  the  generation  of  rational  function 

approximations  for  Laplace  transform  inversion  with 
an  application  to  viscoelasticity,  SIAM  J.  Appl. 

Math.,  24  (1973),  429-440. 

101  Longman,  I.M.,  Best  rational  function  approximation 

for  Laplace  transform  inversion,  SIAM  J.  Math. 

Anal.,  5 (1974),  574-580. 

_ 

102  Lopez,  C.  and  F.J.  Yndurain,  Model  independent  calcula- 

tion of  Kp  dispersion  relations:  Extrapolation  with 

Pade  techniques,  Nuclear  Physics,  B 64_  (1973), 

315-333. 

103  Lopez,  C.  and  F.J.  Yndurain,  The  moment  problem  and 

stable  extrapolations  with  an  application  to  forward 
Kp  dispersion  relations  in  Pade  Approximants  and 

Their  Appl . , Graves-Morris , ed..  Academic  Press, 

New  York,  1974. 

104  Luke,  Y.L.,  The  Pade  table  and  the  t-method,  J. 

Math,  and  Phys. , 37  (1958),  110-127. 

105  Luke,  Y.L.,  The  Special  Functions  and  Their  Applica- 

tions, vols.  1 and  2,  Academic  Press,  New  York, 

1969. 

106  Luke,  Y.L.,  Mathematical  Functions  and  Their  Approxima- 

tions, Academic  Press,  New  York,  1975. 


108 


RECENT  RESULTS  ON  PADE  APPROXIMANTS 

107  Luke,  Y.L.,  On  the  error  in  the  Pade  approximants  for 

a form  of  the  incomplete  Gamma  function  including 
the  exponential  function,  SIAM  J.  Math.  Anal.  (1975), 
829-839. 

108  Luke,  Y.L.,  W.  Fair  and  J.  Wimp,  Predictor-Corrector 

formulas  based  on  rational  interpolants , Comp.  & 
Maths,  with  Appls.  , 1^  (1975),  3-12  . 

109  Maehly,  H. , First  Interim  Progress  Report  on  Rational 

Approximations  Project  NR.  044-196,  Princeton 
University,  June  23,  1958. 

110  Magnus,  A.,  Certain  continued  fractions  associated  with 

the  Padd  table.  Math.  Zeitscnr. , (1962),  361-374. 

111  Magnus,  A.,  P-fractions  and  the  Padd  table.  Rocky 

Mountain  J.  Math.,  4^  (1974),  257-259. 

112  Magnus,  A.  and  J.  Wynn,  On  the  Padd  table  of  cos  z, 

Proc.  Amer.  Math.  Soc.,  47  (1975),  361-367. 

113  Markoff,  A. A.  Dif ferenzenrechnung,  B.G.  Teubner, 

Leipzig,  1896. 

114  Masson,  D. , Padd  approximant  and  partial-wave  integral 

equation,  J.  Math.  Phys.,  8 (1967),  512-514. 

115  Masson,  D. , Hilbert  space  and  the  Padd  approximant, 

in  The  Padd  Approximant  in  Theoretical  Physics, 

Baker  Jr.  and  Gammel,  eds. , Academic  Press,  New 
York,  1970. 

116  Masson,  D. , Pade  approximants  and  Hilbert  spaces,  in 

Pade  Approximants  and  Their  Applications,  Graves- 
Morris,  ed..  Academic  Press.  New  York,  1973. 

117  McGuire,  J.H.,  J.E.  Golden  and  J Nuttall,  Solution 

of  the  schrodinger  equation  using  Pade  approximants 
to  sum  asymptotic  series,  Rocky  Mountain  J.  Math., 

4 (1974),  375-376. 

118  Milosevid,  S.  and  H.F..  Stanley,  Calculation  of  the 

equation  of  state  near  the  critical  point  for  the 
Heisenberg  model  using  Pade  approximants,  in  Pade 
Approx,  and  Their  Appl . , Graves-Morris , ed  . , 

Academic  Press,  New  York,  1973. 

119  Montessus  de  Ballore,  R.  de,  Sur  les  fractions  con- 

tinues algebriques.  Bull.  Soc.  Math.  France,  30 
(1902),  28-36. 


109 


CHARLES  K.CHUI 


120  Narcowich,  F.J.  and  G.D.  Allen,  R-Operators  I. 

Representation  theory  and  applications,  Indiana  J. 
Math. , (To  appear) . 

121  Nuttall,  J. , The  convergence  of  Pade  approximants  of 

meromorphic  functions,  J.  Math.  Analysis  and  Appl. , 

31  (1970),  147-153. 

122  Nuttall,  J.,  The  connection  of  Pade  approximants  with 

stationary  variational  principles  and  the  convergence 
of  certain  Pade  approximants,  in  The  Pade  Approximants 
in  Theoretical  Physics,  G.A.  Baker  Jr.,  ed. , 

Academic  Press,  New  York,  1970. 

123  Nuttall,  J. , Variational  principles  and  Pade  approxi- 

mants, in  Pade  Approximants  and  Their  Applications. 
Graves- Morris , ed. , Academic  Press,  New  York,  1973. 

124  Nuttall,  J.,  The  convergence  of  certain  Pade  approxi- 

mants, Rocky  Mountain  J.  Math.,  4^  (1974),  269-272. 

125  Nuttall,  J. , The  convergence  of  Pade  approximants  for 

a class  of  functions  with  branch  points,  (In  manu- 
script) . 

126  Nuttall,  J.,  Orthogonal  polynomials  for  complex  weight 

functions  and  the  convergence  of  related  Pade 
approximants,  (In  manuscript). 

127  Pade,  H. , Sur  la  representation  approch£e  d'une 

fonction  par  des  fractions  rationelles,  Ann.  Sci. , 
Ecole  Norm.  Sup.  (3),  9^  (1892),  Supplement  1-93. 

128  Fade,  H. , Sur  les  series  entieres  convergentes  ou 

divergentes,  Acta  Math.  18  (1894),  97-111. 

129  Pade,  H. , Sur  la  generalisation  des  fractions  continues 

algebrique,  J.  de  Math.  Pures  et  Appl.,  10^  (1894), 
291-329. 

130  Pade,  H.,  Sur  la  distribution  des  rdduites  anormales 

d'une  fonction,  C.R.  Acad.  Sci.  Paris,  130  (1900), 
102-104. 

131  Perron,  0.,  Die  Lehre  von  den  Kettenbriichen,  Band  II, 

B.G.  Teubner,  Stuttgart,  1957. 

132  Pommerenke,  C. , Pade  approximants  and  convergence  in 

capacity,  J.  Math.  Analysis  and  Appl.,  41^  (1973), 
775-780. 

133  Roberts,  D. , H.P.  Griffiths  and  D.W.  Wood,  The 

analysis  of  double  power  series  using  Canterbury 
approximants,  J.  Phys.  A : Math.  Gen.,  8 (1975), 
1365-1372. 


110 


RECENT  RESULTS  ON  PADE  APPROXIMANTS 


134  Ronveaux,  A. , Padd  approximant  and  homographic  trans- 

formation of  Riccati  Phase’s  equations,  in  Pade 
Approx,  and  Their  Appl.,  Graves-Morris , ed., 

Academic  Press,  New  York,  1973. 

135  Ronveaux,  A.,  M.C.  Dumont-Lepage , and  J.  Habay, 

Counting  the  poles  of  the  solutions  of  a general 
Riccati  equation,  (In  manuscript). 

136  Rossom,  H.  van,  Systems  of  orthogonal  and  quasi 

orthogonal  polynomials  connected  with  the  Pade 
table,  I,  II,  III,  Nederl.  Akad.  Wetensch.  Proc., 

58  (1955),  517-525,  526-534,  676-682. 

137  Rossum,  H.  van,  On  the  poles  of  Pade  approximations 

2 

to  e , Nieuw  Archief  voor  Wiskunde,  19^  (1971), 

37-45. 

138  Saff,  E.B.,  An  extension  of  Montessus  de  Ballore's 

theorem  on  the  convergence  of  interpolating  rational 
functions,  J.  Approx.  Theory,  6 (1972),  63-67. 

139  Saff,  E.B.  and  R.S.  Varga,  Convergence  of  Pade 

— 2 

approximants  to  e on  unbounded  sets,  J.  Approx. 
Theory,  13  (1975),  470-488. 

140  Saff,  E.B.  and  R.S.  Varga,  On  the  zeros  and  poles  of 

Pade  approximants  to  ez,  Num.  Math.  (To  appear). 

141  Saff,  E.B.  and  R.S.  Varga,  On  the  sharpness  of  theorems 

concerning  zero-free  regions  for  certain  sequences 
of  polynomials,  Num.  Math.  (To  appear). 

142  Saff,  E.B.,  R.S.  Varga  and  W.-C.  Ni,  Geometric  con- 

— z 

vergence  of  rational  approximations  to  e in 
infinite  sectors,  Num.  Math.  (To  appear). 

143  Salzer,  H.E.,  Note  on  osculatory  rational  interpola- 

tion, Math.  Comp.,  16^  (1962),  486-491. 

144  Shafer,  R.E.,  On  quadratic  approximation,  SIAM  J. 

Numer.  Anal.,  LI  (1974),  447-460. 

145  Shamash,  Y. , Linear  system  reduction  using  Pade 

approximation  to  allow  retention  of  dominant  modes, 
Int.  J.  Control,  21  (1975),  257-272. 

146  Sobhy,  M.I.,  Applications  of  Pade  approximants  in 

electrical  network  problems,  in  Pade  Approx,  and 
Their  Appl. , Graves-Morris,  ed. , Academic  Press, 

New  York,  1973. 


Ill 


CHARLES  K.  CHUI 


147  Stahl,  H.,  Ph.D.  Thesis,  Technical  University  Berlin, 

(To  be  completed). 

148  Stieltjes,  T.-J.,  Recherches  sur  les  fractions  con- 

tinues, Ann.  Fac.  Sci.,  Toulouse  ^ (1894),  1-122  and 
9 (1895),  1-47. 

149  Szego,  G.,  Orthogonal  polynomials,  Amer.  Math.  Soc. 

Colloquium  Publications,  2J3,  Amer.  Math.  Soc.,  New 
York,  1939;  revised  ed.,  1959. 

150  Tarko,  H.B.  and  M.E.  Fisher,  Theory  of  critical  point 

scattering  and  correlations.  III.  The  Ising  model 
below  T and  in  a field.  Physical  Review  B,  11 

(1975),  1217-1253. 

151  T/jon,  J.A.,  Pade  approximants  in  three-body  calculations. 

Physical  Review  D,  1 (1970),  2109-2112. 

152  T)on,  J.A.,  Application  of  Pade  approximants  in  the 

three-body  problem,  in  Pade  Approx,  and  Their  Appl. , 
Graves-Morr is , ed..  Academic  Press,  New  York,  1973. 

153  Turchetti,  G.  , Matrix  Pade  approximants  in  ""W 

scattering,  Lettere  al  Nuovo  Cimento,  6 (1973), 

497-503. 

154  Underhill,  C.,  and  A.  Wragg,  Convergence  properties 

of  Pad£  approximants  to  exp(z)  and  their  derivatives, 

J.  Inst.  Math.  Appl.,  11  (1973),  361-367. 

155  Waadeland,  H. , T-fractions  from  a different  point  of 

view,  Rocky  Mountain  J.  Math.,  4.  (1974),  391-393. 

156  Waadeland,  H.,  Two  convergence  theorems  for  T-fractions, 

(Unfinished  manuscript). 

157  Wall,  H.S.,  The  Analytic  Theory  of  Continued  Fractions, 

D.  van  Nostrand,  Toronto-New  York-London,  1948. 

158  Wallin,  H. , On  the  convergence  theory  of  Pad£  approxi- 

mants, ISNM,  20  (1972),  461-469. 

159  Wallin,  H.,  The  convergence  of  Pade  approximants  and 

the  size  of  the  power  series  coefficients,  Applicable 
Analysis,  4 (1974),  235-251. 

160  Walsh,  J.L.,  On  approximation  to  an  analytic  function 

bv  rational  functions  of  best  approximation.  Math. 

Z. , 38  (1934),  163-176. 

161  Walsh,  J.L.,  Pade  approximants  as  limits  of  rational 

functions  of  best  approximation,  J.  Math.  Mech.  _1J3 
(1964),  305-312. 


112 


RECENT  RESULTS  ON  PADE  APPROXIMANTS 


162  Walsh,  J.L.,  Pade  approximants  as  limits  of  rational 

functions  of  best  approximation,  real  domain,  J. 
Approx.  Theory,  1J_  (1974),  225-230. 

163  Walsh,  J.L.,  The  role  of  the  pole  in  rational  approxima 

tion.  Rocky  Mountain  Math.  J.,  4^  (1974),  283-286. 

164  Warner,  D.D.,  Hermite  interpolation  with  rational 

functions,  Ph.D.  Thesis,  University  of  California 
at  San  Diego,  1974. 

165  Warner,  D.D.,  An  extension  of  Saff's  theorem  on  the  con 

vergence  of  interpolating  rational  functions,  (To 
appear) . 

166  WTieeler,  J.C.,  Modified  moments  and  Gaussian  quadratures 

Rocky  Mountain  J.  Math.,  4 (1974),  287-296. 

167  Wheeler,  J.C.,  M.G.  Prais,  and  C.  Blumstein,  Analysis 

of  spectral  densities  using  modified  moments. 

Physical  Review  B,  _10  (1974),  2429-2447. 

168  Wilson,  R. , Divergent  continued  fractions  and  non- 

polar singularities,  Proc.  London  Math.  Soc . , W 
(1930),  38-57. 

169  W'uytack,  L.,  The  use  of  Pade  approximation  in  numerical 

integration,  Proc.  Colloquium  Euromech  58,  on  the 
Method  of  Pad4  and  its  applications  to  Mechanics, 
Toulon,  1975. 

170  Wuytack,  L.,  On  the  osculatory  rational  interpolation 

problem.  Math.  Comp.,  !29  (1975),  837-843. 

171  Wuytack,  L. , Eigenschaften  eines  Algorithmus  zur 

rationalen  Interpolation,  ISNM  26  (1975),  193-199. 

172  Wuytack,  L. , Numerical  integration  by  using  nonlinear 

techniques,  J.  Comp,  and  Appl . Math.,  (To  appear). 

173  Wynn,  P.,  L'  e-algorithms  e la  tavola  di  Padd,  Rend. 

di  Mat.  Roma,  20  (1961),  403-408. 

174  Wynn,  P. , Upon  systems  of  recursions  which  obtain  among 

the  quotients  of  the  Pad6  table,  Num.  Math.,  8 (1966) 
264-269. 

175  Wynn,  P.,  Some  recent  developments  in  the  theories  of 

continued  fractions  and  the  Pade  table.  Rocky 
Mountain  J.  Math.,  4^  (1974),  297-324. 

176  Yndurain,  F.J.,  Stable  extrapolation  of  scattering 

amplitudes  using  unitarity,  Annals  of  Physics,  75^ 
(1975),  171-218. 


113 


CHARLES  K.CHUI 


177  YndurAin,  F.J.,  Convergence  of  regularized  Pad£ 

approximants  for  semi-infinite  range  Stieltjes 
function,  Presented  to  the  Euromech  Colloquium, 

Toulon  (France) . 

178  Zakian,  V.,  Properties  of  I approximants,  in  Pade 

MN  

Approx,  and  Their  Appl.,  Graves-Morris , ed.. 

Academic  Press,  New  York,  1973. 

179  Zakian,  V.,  Properties  of  I and  approximants 

and  applications  to  numerical  inversion  of  Laplace 
transforms  and  initial  value  problems,  J.  Math. 

Anal,  and  Appl.,  _50  (1975),  191-222. 

180  Zinn-Justin,  J.,  Convergence  of  Pad4  approximants  in 

the  general  case.  Rocky  Mountain  J.  Math.,  4_  (1974), 
325-330. 

Supplementary  references 

1 Atkinson,  F.V. , On  lacunary  and  other  orthogonal 

polynomials.  Math,  y Fisica  Teorica,  _10  (1954), 

97-110. 

2 Atkinson,  F.V.,  Orthogonal  polynomials  and  lacunary 

approximants,  in  Padfe  Approximants  and  Their  Appl., 
Graves-Morris,  ed. , Academic  Press,  London  and  New 
York,  1973. 

3 Chisholm,  J.S.R.  and  R.  H.  Jones,  Relative  scale  co- 

variance  of  N-variable  approximants,  Proc.  R.  Soc. 

Lond.  A.  344  (1975),  465-470. 

4 Engels,  J.  and  J.  Fleischer,  Legendre  Pade  approximants 

in  ffW  Scattering,  II  Nuovo  Cimento,  24^  (1974),  73-92. 

5 Field,  D.A. , Nested  sequences  of  lens-shaped  regions 

which  contained  Pade  and  continued  fraction  approxi- 
mations to  a series  of  Stieltjes,  (Submitted  for 
publication) . 

6 Fleischer,  J. , Analytic  continuation  of  scattering 

amplitudes  and  Pade  approximants,  Nuclear  Phy. 

B 37  (1972),  59-76. 

7 Fleischer,  J. , Generalizations  of  Pade  approximants, 

in  Pade  Approximants,  Graves-Morris,  ed.,  The  Institute 
of  Physics,  London,  1973. 


114 


RECENT  RESULTS  ON  PADE  APPROXIMANTS 


8 Fleischer,  J.,  Nonlinear  Pade  approximants  for  Legendre 

series,  J.  Math.  Phys.,  1^  (1973),  246-248;  and  in 
Pade  Approximants  and  Their  Appls.,  Graves-Morris , ed., 
Academic  Press,  London-New  York,  1973. 

9 Graves-Morris,  P.R.  and  R.H.  Jones,  An  analysis  of  two 

variable  rational  approximants,  (In  manuscript). 

10  Nuttall,  J.  and  S.R.  Singh,  Extension  of  proof  of 

convergence  of  Pade  approximants  for  a class  of 
functions  with  branch  points,  (In  manuscript). 

11  Watson,  P.J.S.,  Two-variable  rational  approximants: 

a new  method,  J.  Phys.  A : Nucl.  Gen.,  1_  (1974), 
L167-L170 . 


Charles  K.  Chui* 
Department  of  Mathematics 
Texas  A&M  University 
College  Station, 

Texas,  77840 


*Supported  in  part  by  the  U.  S.  Army  Research  Office  under 
Grant  Number  DAHC04-75-G-0186 . 


115 


DEGREE  OF  APPROXIMATION 
Ronald  A.  DeVore 

We  examine  techniques  for  estimating  the  degree  of  approx- 
imation of  a function  in  terms  of  its  smoothness.  This  is  done 
in  the  settings  of  polynomial  and  spline  approximation  in  L , 
pointwise  and  local  estimates  in  C,  as  well  as  some  remarks*3 
on  constrained  approximation. 

1 Introduction 

A major  portion  of  the  theory  of  approximation  of  func- 
tions is  concerned  with  the  connections  between  the  structural 
properties  of  a function  and  its  degree  of  approximation.  The 
objective  is  to  relate  the  smoothness  of  the  function  to  the 
rate  of  decrease  of  the  degree  of  approximation  to  zero.  We 
are  interested  here  in  examining  these  questions  for  polynomial 
approximation,  both  algebraic  and  trigonometric,  as  well  as 
spline  approximation.  These  are  then  the  most  classical  set- 
tings where  the  results  are  the  most  penetrating  and  satisfy- 
ing. 

We  will  limit  ourselves  to  only  one  aspect  of  the  problem, 
namely,  the  derivation  of  estimates  for  the  degree  of  approx- 
imation in  terms  of  smoothness.  These  are  the  direct  estimates 
of  approximation  which  are  also  commonly  called  Jackson  Theor- 
ems, after  D.  Jackson,  who  first  obtained  such  results  for 
trigonometric  approximation.  We  will  measure  smoothness  using 
the  r-th  order  modulus  of  smoothness.  Estimates  given  for  the 
r-th  order  modulus  of  smoothness  automatically  include  lower 

order  estimates  in  terms  of  the  j-th  order  modulus  of  smooth- 
(k) 

ness  of  f , with  j+k<r. 

Our  desire  here  is  not  to  try  to  be  encyclopedic  or  sur- 
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vey  the  subject,  but  rather  we  emphasize  techniques  which  are 
simple,  general,  and  give  sharp  results.  We  strike  out  then  to 
give,  as  much  as  possible,  a unified  viewpoint  to  obtaining 
direct  theorems  of  approximation.  Wherever  possible,  we  seek 
linear  methods  to  do  the  job. 

Perhaps  the  most  useful  technique  in  deriving  direct  est- 
imates is  smoothing,  where  the  arbitrary  function  f is  approx- 
imated first  by  a smooth  function  and  then  the  smooth  function 
is  approximated  by  the  appropriate  polynomials  or  splines.  In 
such  considerations, the  actual  method  of  smoothing  is  not  im- 
portant, although  sometimes  it  is  desirable  to  do  the  smoothing 
with  linear  operators.  VJhat  is  important  is  the  connection  be- 
tween the  error  in  approximation  and  the  deterioration  of  the 

smoothness.  For  example,  if  wo  approximate  f by  a function  g 

• (f)  . . . 

with  g in  L , then  we  are  interested  in  how  small  the  norm 

P (r) 

of  f-g  can  be  made  compared  to  the  norm  of  g . One  way  of 

making  such  comparisons  is  through  the  Peetre  K-functional  as 

described  in  Section  2. 

There  is  a very  intimate  connection  between  the  Peetre  K- 
functional  and  the  more  classical  modulus  of  smoothness  as  is 
put  forth  in  Theorem  2.1.  From  this  theorem,  it  follows  that 
for  most  problems  in  the  degree  of  approximation,  the  K-func- 
tional and  the  modulus  of  smoothness  are  the  same.  Hence,  in 
seeking  estimates  for  the  degree  of  approximation,  we  can  just 
as  well  obtain  estimates  in  terms  of  the  K-functional.  This 
will  mean  that  it  suffices  to  prove  estimates  for  smooth  func- 
tions, which  can  significantly  simplify  matters. 

In  Section  3,  we  discuss  local  approximation  by  polyno- 
mials of  fixed  degree.  The  main  theme  is  to  examine  the  depen- 
dence of  the  degree  of  approximation  on  the  length  of  the  int- 
erval. This  is  easy  enough  to  obtain,  but  nevertheless  impor- 
tant as  building  blocks  for  spline  approximation.  Moreover, 
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in  some  sense,  even  polynomial  approximation  can  be  viewed  as 
piecing  together  local  approximations  at  the  expense  of  letting 
the  degree  of  the  polynomials  grow. 

We  discuss  spline  approximation  in  Section  4.  There  are 
several  ways  to  generate  good  spline  approximations.  We  restr- 
ict our  attention  to  two  of  them  given  by  V.  Popov  - Bl.  Sendov 
and  C.  deBoor  - G.  Fix  respectively.  These  approaches  expose 
readily  the  highly  local  nature  of  spline  approximation. 

Trigonometric  polynomial  approximation  is  discussed  brief- 
ly in  Section  5.  Here,  the  results  are  well  known  and  easy  to 
obtain.  The  coresponding  case  of  algebraic  polynomial  approx- 
imation requires  more  care  and  effort.  The  Jackson  estimates 
for  the  space  c[j-l,l]  can  be  gotten  simply  by  reverting  back  to 
the  periodic  case  via  the  usual  substitution  x*cos  9.  We  how- 
ever give  a direct  construction  in  the  algebraic  case  since 
this  carries  over  to  general  L . This  is  done  in  Section  6. 

A satisfactory  description  of  algebraic  polynomial  approx- 
imation requires  the  use  of  pointwise  estimates  since  the  app- 
roximation can  be  improved  near  the  end  points  of  the  interval. 
We  look  at  such  pointwise  estimates  in  Section  7 for  c|j-l,lj. 
Here,  we  find  it  useful  to  revert  back  to  the  trigonometric 
case  since  all  the  constructions  can  be  made  with  modifications 
of  the  classical  Jackson  operators. 

In  Section  8,  we  look  at  the  constrained  problem  of  mono- 
tone approximation.  While  the  addition  of  the  constraint  makes 
the  problem  much  more  difficult,  some  of  the  general  lines  of 
attack  still  carry  over. 

A word  about  proofs.  In  all  instances  at  least  a sketch 
of  the  proof  is  given  --  the  most  sketchy  being  for  monotone 
approximation.  In  the  absence  of  complete  details,  there  is  of 
course  an  appropriate  reference  to  the  literature.  We  use  for 
notational  convenience  the  convention  that  C always  denotes  a 
constant  although  not  always  the  same  one  even  in  a given  line. 
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The  exact  dependence  of  C on  other  parameters  is  always  explic- 
itly given. 


2 Moduli  of  smoothness  and  the  K-functional 


Let  I- [-1,1]  and  for  l<p<°°,  let  L^(I)  denote  the  set  of 
real  valued  p-th  power  integrable  functions  on  I,  with  ||*||  (I) 

the  usual  L norm  on  I.  Similarily,  let  L (2tt)  denote  the  set 
P P 

of  2tt  periodic  real  valued  functions  which  are  p-th  power  int- 
egrable on  [j-ir.iT]  and  jj*||  (2ir)  the  L^  norm  on  [-ir,ir].  The 

spaces  C(I)  and  C(2tt)  are  defined  likewise  with  the  L norm. 

00 

We  use  the  notation  ||*||  without  the  I or  2ir  designation  when 
there  is  no  possibility  of  confusion. 

If  feL  (2tt),  h>0,  and  r is  a positive  integer,  we  define 
the  r-th  order  modulus  of  smoothness  of  f as 


(2.1) 


r >P 


(f,h)  *=  sup  !| A J (f , x)||  , 


o<t<h 


• r • . 

with  Ap(f)  the  r-th  forward  difference  of  f with  step  size  t. 

Throughout,  we  keep  the  convention  that  norms  are  taken  with 

respect  to  the  variable  x.  For  feC(2ir),  we  have 


(2.2)  wr(f,h) 


to  (f,h) 
r ,°° 


sup  H A (f,x) 
o<t<h 


In  L (I),  the  modulus  of  smoothness  needs  a little  more 
P 

care  in  its  definition.  When  t>0,  let  I - [-1 , 1-t]  . Then,  for 

feL  (I),  define 
P 


(2.3) 
Here , 


w (f,h)  * sup  ||  A (f  *x)||  (I  ) • 
,P  o<t<h  c p rz 


(I  ) indicates  that  the  L norm  is  taken  over  the 
P rt  p 

interval  I . This  notational  convention  will  be  used  through- 
rt 

out  the  paper.  For  feC(I),  we  have 

(2. A)  « (f,h)  - (o  ^(f.h)  - sup  ||A[(f,x)||  (I  ). 
r r»  o<t<h  c rt 


120 


DEGREE  OF  APPROXIMATION 


The  fundemental  role  of  higher  order  moduli  of  smoothness  in 
accurately  describing  degree  of  approximation  was  first  recog- 
nized by  A.  Zygmund  [36]  in  his  characterization  of  functions 

approximated  with  order  0(n  as  the  set  of  functions  with  4 

to 2 ( ^ » t ) = 0(t).  Actually,  a modulus  of  smoothness  can  already 
in  its  very  definition  be  considered  as  a measure  of  the  degree 
of  approximation  of  f by  certain  combinations  of  its  trans- 
lates. This  of  course  is  not  a very  interesting  approximation 
process  because  the  approximants  are  no  simpler  or  smoother 

« 

than  the  original  function.  However,  the  modulus  of  smoothness 
is  also  fundementally  connected  with  approximation  by  smooth 
functions,  as  can  be  brought  out  best  with  the  aid  of  the 

I 

Peetre  K-functional . 1 

If  r is  a positive  integer  and  l<p<0°,  let  W (I)  be  the  I 

(r-l)~  r,p  . 1 

set  of  those  functions  f,  with  f absolutely  continuous  , 

C r ^ . i 

and  f in  L (I).  W ( 2tt ) is  defined  similanly.  For  fe 
P r » P 

L (I),  l<p<®,  define  . 

P 

(2.5)  K (f,h)  - inf  {||f-g|!  + h!!g(r)||  }. 

,p  gewr  (I)  P P 

r »P 

For  feC(I),  we  have 

(2.6)  K(f,h)  - K (f,h)  - inf  {\\f-g\\m  * hl|g(r)||  } . 

r r’  -(« 

1 * 

For  a periodic  function  f,  the  K-functional  K (f,h)  is  de- 
fined  similarily.  The  K-functional  measures  how  well  f can  be 
approximated  by  smooth  functions  g with  a control  on  the  size 

of  l|g(r,|lp- 

For  most  purposes,  the  K-functional  and  the  modulus  of 
smoothness  are  equivalent  for  measuring  the  degree  of  approx- 
imation. Indeed,  we  have  the  following  theorem  of  J.  Peetre 

[27],  H.  Johnen  [21]  in  the  non-periodic  case,  (see  also  ( 

G.  Freud  [l7] , and  G.  Freud  - V.  Popov  [l9]  for  the  non-per- 
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iodic  case  and  p«°°) . 


THEOREM  2.1.  There  are  constants 
r,  such  that  if  feL  (I)  (or  feL  (2ir)),  1 

p — p 

feC(2TT)),  p«°,  then 

(2.7)  C u>  (f,t)  < K (f,tr)  < C d 

1 r,p  - r,p  - 2 r,p 


> 0 depending  only  on 
<p<°°,  and  feC(I)  (or 


(f,t),  t>0. 


Proof.  The  lower  estimate  in  (2.7)  is  shown  by  using  standard 

estimates  for  r-th  differences.  Let  M denote  the  B-spline  of 

r 

order  r (degree  r-1)  with  knots  at  0,1,..., r (see  Section  4). 

Then,  M is  non-negative,  vanishes  outside  of  (0,r),  and  we 

r • 

take  a normalization  so  that  / M (t)dt  * 1 . M is  the  Peano 

or  r 

kernel  for  r-th  differences  (see  [9]  ) at  the  points  0,1,..., r 

Hence,  if  geW  (I), 

^ > P 

00 

(2.8)  A*(g,x)  - tr  / g^  (x+u)Mf(t  *u)t  *du. 


Using  Minkowski's  inequality  for  integrals  and  the  fact  that 

es< 

(r). 


M^  has  integral  one,  we  can  estimate  for  any  geW^  p(I) 


ll^(g,x)||p(Irt)  < tr||g 

When  feL  (I),  t>0,  and  g is  an  arbitrary  function  from 
P 

W (I),  then 
rtP 

ll^(f,x)||  (Irt)  < ||A*(f-g,x)!|  (Irt)  + ||A'(g,x)||  (Irt) 


< 2r{|!f-g||p  ♦ tr  ||g(r)| 


Taking  an  infimum  over  all  g on  the  right  hand  side  and  a sup- 
remum  over  all  0<t<h,  we  find 

d (f ,h)  < 2rK  (f ,hr) . 
r.P  r»P 

The  same  reasoning  goes  over  in  the  periodic  case.  Thus,  we 
have  proved  the  lower  estimate  in  (2.7). 

The  other  half  of  the  inequality  (2.7)  requires  the  const 
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ruction  of  a smooth  approximation  to  f.  This  is  simplest  to  do 

in  the  periodic  case.  To  see  the  idea,  consider  first  r*l . We 

can  use  Stecklov  averages.  Fix  t>0  and  for  feL  (2tt)  define 

P 

_x  t 

g(x)  ■ t / f(x+u)du. 
o 

Then, 

, t 

l|f-g!L  i t / I|f(x+u)-f(x)||  du  < u>  (f,t). 

P Q P 1 »P 

Also,  if  F is  a primitive  of  f,  then  g(x)  = t * (F (x+t)-F (x) ) , 
and  so  g' (x)  * t ^ (f (x+t)-f (x) ) . Hence, 

tig’ll  - II  f (x+t)-f  (x)||  < u (f,t). 

p p l»p 

Putting  these  last  two  inequalities  together  shows  that 

Ki  n(f»t)  1 llf~glL  + t ||g'||  < 2m  (f,t),  t>0,  feL  (2tt). 

l,p  - p p - l,p  p 

For  the  general  case  of  r,  we  use  a type  of  higher  aver- 
ages. Let  feL  (2tt)  and  t>0,  and  now  define 
P 

00 

(2.9)  g (x)  - / { (-l)r+1A^(f ,x)+f (x)}Mr(t  1u)t  Tdu 

—O0 

00 

■ f(x)  + (-l)r+*  / A^(f,x)M^(t  *u)t  du. 

— oo 

Recalling  that  is  supported  on  (0,r)  and  has  integral  one, 
we  find  that 

00 

(2.10)  ||f-g||  < / ||A^(f,x)||Mr(t'1u)t“1du 

< / llAtu(f,X^pdu  - “r.p^ ,rt^  ’ 

Here,  we  changed  variables  to  return  to  (0,r)  and  used  the  fact 
that  Mr(u)  < l,for  all  u,  in  estimating  the  second  inequality. 
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To  estimate  the  r-th  derivative  of  g,  let  be  a primi- 
tive of  f and  in  general  a primitive  of  F^  for  k>2.  Then 
for  a typical  term  in  g,  we  have 


/ f(x+ku)M^(t  ^u)t  ^du 


00 

/ f (x+u)M  ((kt)  1u)  (kt)  ^du 

— oo 


t'rA.  -,(Fr,x), 


■* « ’ 

kt 


It  follows  that  the  L (2^)  norm  of  the  der- 

” 'r 


because  of  (2.8). 

ivative  of  such  a term  can  be  estimated  by  t 

• • • (r) 

This  gives  the  following  estimate  for  g 
(2.11)  t1]!  g(r)| 


< (2r-l)  max  '|A.r  (f,x)H  < (2r-l)m  (f,rt). 

P ~ l<k<r  kt  P " r*P 


The  inequalities  (2.10)  and  (2.11)  combine  to  show  that 

K (f,tr)  < I!f-g|l  + tr!!g(r)||  < 2roi  (f ,rt)  < Cm  (^t) 

r,p  P P “ r.P  - r,p 

with  C a constant  depending  only  on  r.  This  is  the  right  hand 
side  of  (2.7)  for  periodic  functions. 

The  right  hand  side  of  (2.7)  for  I is  a little  more  dif- 
ficult to  obtain  due  to  the  lack  of  periodicity.  There  are 
several  possible  approaches.  One  way  is  to  extend  the  function 
f to  a larger  interval  and  then  use  the  same  approach  as  we 
have  just  given  for  the  periodic  case.  This  was  done  by  G. 
Freud  - V-Popov  [i 9 J , for  the  case  that  p”°°,  with  an  extension 
that  depends  on  t . It  is  also  possible  following  H.  Whitney 
[33  J and  0.  Besov  [3]  to  extend  f to  a larger  interval  by  using 
a linear  operator  T and  retaining  the  smoothness  of  f.  This 
extension  will  be  discussed  at  the  end  of  this  section. 

It  was  recently  pointed  out  to  me  by  H.  Johnen  and  K. 
Scherer  how  a simple  modification  of  the  technique  given  in  the 
periodic  case,  already  handles  the  non-periodic  case  as  well. 
This  avoids  completely  the  problem  of  extending  f.  In  order  to 
point  out  this  proof,  let  us  first  record  what  can  be  shown 
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using  exactly  the  same  reasoning  as  given  above  for  the  period- 
ic case. 


LEMMA  2.1.  Lf  feLp (I) , t>0,  and  [a,b+r  t]  Cl,  then  with  J* 

fa,b],  there  is  a function  geW  (J)  such  that 
r,p 

(2.12)  ||  f-g||  p(J)  < / ||  A*u(f  ,x)||  p (J)du  < Cmr  p(f,t), 


(2.13)  tr||g(r)||  (J)  < (2r-l)  max  11*5.  (f  ,x)|l  (J)  < Cm  (f,t), 

p l<k<r  kt  p " r»P 

with  C £ constant  depending  only  on  r.  Similarly,  if  [a-r  t,b] 

C_ I , then  there  is  a function  g s£  that  (2.12)  and  (2.13)  hold 

• r r r*  t 

with  replaced  by  A ^ rn  (2.12)  and  A^  replaced  by  A_^t 

in  (2.13). 

Proof . The  proof  is  exactly  the  same  as  in  the  periodic  case 

with  (2.12)  coresponding  to  (2.10)  and  (2.13)  coresponding  to 

2 

(2,11).  The  function  g is  defined  as  in  (2.9)  when  [a,b+r  t] 

• r . . t • 

$ I and  as  in  (2.9)  except  that  A_  is  used  in  place  of  A in 

2 u u 

the  case  that  [a-r  t,b]<^I. 


To  obtain  the  estimate  for  the  K-functional  from  (2.12) 

and  (2.13),  let  if  be  a function  with  (Kx^O,  4>(x)=l, 

£<x<l,  0<<j>(x)<l,  xel,  and  l!$^ll  <C,  i*0,l,...,r,  with  C dep- 

2 

ending  only  on  r.  Let  2r  t<l,  g^  be  the  function  guaranteed  by 
Lemma  2.1  for  the  interval  J »=|-1,-£|  and  g2  the  coresponding 
function  for  the  interval  J I — i » 1 I • The  function  g = ( 1 — d> ) 

+ <Pf>2  = gx  + <f'(g2-g1)  satisfies 

(2.14)  ||  f-g|p  < ! f~g1llp(J1)  + l|f'g2llp(J2)  ^ C“r,p(f*t)» 

because  of  (2.12). 

( r)  • • 

We  need  also  an  estimate  for  g . The  crucial  interval 
is  J^*=  where  we  have 


(r), 


P(J3)  i 


(r), 


(J.) 

P 1 


max  || 
o<i<r 


r(i)-K(i)' 
>1  o' 


(J,)} 


1 P 3 


125 


RONALD  A.  DeVORE 


; c{IUlr)llpw,)  * lU^' 

< C{||  ♦ ||  gW 


H p<J2)  * ^8l'82!!p(V( 

M (J«)  + max  H f-g.  I|  (J  )}. 
P ^ i=l,2  p 


Here,  we  have  used  the  well  known  fact  that  for  any  h defined 
on  J3,  we  have  l|  h^1^  ))  p (J3)  < c{|[hj|p(J3)  + |)  h(r^  ||  ^ (J3> } , 
1 < i < r with  C depending  only  on  r. 

Our  inequality  for  g clearly  also  holds  on  [-1,-£J 
and  [$,l],  Hence, 


(2.15)  tr|!g(r)|!  <C  max 

P 1 


if-gJIjOi)  * l!g?r>l 


Pwi» 


< Co)  (f  ,t)  , 

r,p 


where  we  have  used  (2.12)  and  (2.13).  The  inequalities  (2.14) 

(r) 


and  (2.15)  show  that  for  r^t 


k _(f,tr)  < I!  f-g |!  + tr M g 

r » P P 


< Cti j ( f , t ) . 

p - r ,p 

. . 2 
This  inequality  automatically  also  holds  for  2r  t<l  because 

K is  sub-additive  and  00  is  monotone.  Thus  we  have  proved 
r,p  . r,p 

(2.7)  in  the  non-periodic  case  as  well. 


In  our  above  proof,  we  could  actually  have  given  a finer 
estimate  in  that  we  could  have  used  the  middle  estimates  from 
(2.12)  and  (2.13).  This  gives  the  following  lemma. 

_2 

LEMMA  2.2.  If  feL_ (I) , 0<t<i  r , then  there  is  a geW_  _(I) 
— p — — r»P 

with 

(2.16)  ||f-g||  < C j' II  Atu(f  ,x)  I!  p(Irtu)du 

v o p 

(2.17)  tr||g(r)|!  <C  max  II  ^ (f  ,x)  ||  _(Irkt) 

p o<k<r  v 

* /r|Uju(£.x)||  (Irtu)du  . 

O r 

with  C depending  only  on  r. 
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Proof . Let  g = (l-<j>)g^  + ^g^  as  above  and  estimate  llf-gjl  as 

in  (2.14)  except  now  use  the  middle  term  in  (2.12).  Note  that 

a[  (f,x)  = (— 1 ) r Ar  ( f ,x-rs)  . To  estimate  II  g^r^  |!  , we  argue  ex- 

s s p 

actly  as  in  (2.15)  except  that  we  now  use  the  middle  terms  of 
(2.12)  and  (2.13). 


Many  of  the  approximation  processes  developed  in  the 

following  sections  will  show  how  polynomials  and  splines  can 

also  be  used  as  the  function  g in  the  estimate  of  Kr(f,t)  in 

terms  of  the  modulus  of  smoothness.  The  Jackson  estimates 

(r) 


handle  the  term  ||  f-g||  . The  term 


g 


is  estimated  in  a 


way  similar  to  the  usual  proofs  of  the  Bernstein  inverse 
theorem  for  approximation  by  trigonometric  polynomials  [22] . 

Let  us  point  out  some  uses  of  the  K-functional . Suppose 
that  we  want  to  approximate  functions  from  L^(I)  or  Lp(2n)  by 
elements  from  some  set  A such  as  polynomials  or  splines.  Then, 
it  is  enough  by  virtue  of  Theorem  2.1  to  estimate  the  approx- 
imation for  smooth  functions  geW  (I)  or  respectively  W (2ir). 

& r,p  v r,p 

THEOREM  2.2.  Let  e>0,  l<p<®.  Suppose  that  for  each  geW^  ^(1) 
there  is  an  element  a eA  such  that  |lg-aj!_  1 C,  er||  g^r^|| 


g 


g"P  - 1 


with 


C^>1  depending  only  on  r.  Then,  for  each  feL^(I),  l<p<°°,  £r 
feC(I),  p=°°,  there  is  an  aeA  such  that 


llf-a!!p  5 c2“r,p(f’e)» 

with  C2  depending  only  on  r.  A similar  result  holds  in  the 
periodic  case . 

Proof . We  consider  only  the  case  of  L^(I).  The  other  cases 

are  similar.  If  feL  (I)  and  g is  any  function  in  W (I)  with 

P r »P 

a^  as  in  the  hypotheses,  then 

lf-a  IL  < llf-sll  * II g-a Jl  < C {|| f “gll  * ^ llg(r)|IJ. 


g'P  ” 


g'p  - 1' 


By  the  definition  of  the  K-functional,  we  can  choose  g approp- 
riately so  that 
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f-“g\  2ClKr,p(t-Er>i  C2"r,p<f'E>' 
where  we  have  used  Theorem  2.1  in  our  last  inequality. 


If  the  approximation  is  given  by  a linear  operator  L,  then 
Lf  will  provide  the  desired  approximation. 

THEOREM  2.3.  Let  l<p<°°.  Suppose  L J^s  ja  bounded  1 inear  opera- 

tor  from  Lp(I)  (C(I),p=®)  into  itself.  If  e>0  and  for  each  g 

in  W _(I)  we  have  ||g-Lg|!  < C.e*  llg^ll  with  C.>1  depending 

r,p  p - 1 p 1-  — 

only  on  r.  Then  for  each  feL  (I)  (C(I),p=°°)  we  have 


||f-Lf||  < C u)  (f , e)  , 

F ^ ^ > F 

with  C 2 a constant  depending  only  on  r and  H lM . A similar  re- 
sult holds  in  the  periodic  case . 

Proof.  We  again  only  consider  L (I),  l<p^°°.  The  other  cases 

P 

are  the  same.  If  feL  (I)  and  g is  any  function  in  W (I),  we 

P r »P 

have 


llf-Lf||p  < !|f-g||p  + Ilg-Lg||p  + )|Lf-Lg|lp 

< ClCI*l|lJI){l|F-g||p  * Pr  llg(r)Hp}. 

Taking  now  an  infimum  over  all  g and  using  the  definition  of 
the  K-functional  as  well  as  Theorem  2.1,  we  find 

||  f~Lf|!  < CK  (f,er)<  C.w  (f ,e)  , 

p - r ,p  - 2 r,p  ’ ' 

with  depending  only  on  r and  |IlI|  . 

When  the  approximation  is  given  by  a linear  operator  and 
p*°°,  we  can  even  give  pointwise  estimates. 

THEOREM  2.4.  Suppose  L j_s  £ bounded  linear  operator  from  C(I) 

into  itself  and  e(x)>0,  -1<x<1.  ^f  C^>1  i_s  such  that  for  each 

geW  (I),  we  have 
r,°° 

|g(x)-L(g,x)  | < g ^|| (x) ) r , -1<x<1. 

Then  for  each  feC(I) , we  have 
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| f (x)-L(f  ,x)  | < c2u)r(f  ,e(x))  , -1<x<1, 

with  depending  only  on  r and  IIlII  . A similar  result  holds 
for  C(2n) . 

Proof . Fix  x and  consider  feC(I)  and  g an  arbitrary  function 
in  W (I).  Then, 

r 00  * 

1 » 

| f (x)-L(f ,x) I < ! f (x) — g (x) | + | g(x)-L(g,x) | + lL(f-g,x)! 

< (1+|L|  )||  f-gll^  + C1l|g(r)|!oo(e(x))r 

< c1(i-H!l|! )f!!f-g|!oo+  (e(x))rl!g(r)i|oo}. 

Taking  now  an  infimum  over  all  such  g and  using  the  definition 
of  the  K-  functional  and  Theorem  2.1,  we  find 

I f (x)-L(f  ,x)  | < CKr(f,(e(x))r)  < C2oir(f  ,e(x)), 

with  C2  depending  only  on  r and  ||l||  . Since  x was  arbitrary,  we 
have  proved  the  theorem. 

These  last  three  theorems  will  enable  us  in  what  follows 
to  just  prove  our  estimates  for  smooth  functions.  We  would 
have  to  do  this  in  any  case  but  in  most  instances  this  is  made 
simpler  with  added  smoothness. 

Let  us  give  another  easy  application  of  Theorem  2.1  by 
showing  how  a function  can  be  extended  from  one  interval  to  a 
larger  interval  with  no  loss  of  smoothness  as  measure  by 
The  method  of  extension  was  given  by  H.  Whitney[33]  . Let  J« 
[0,1]  and  as  usual  I*[-l,l].  We  will  examine  the  extension 
from  J to  I. 


If  feL  (J)  (C(J)),  then  define 
P 


where  the  numbers  c.  satisfy 

1 1 
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(2.19) 


l 

i»o 


c.C-2'V  - 1, 


j“0.1 


The  function  Tf  is  an  extension  of  f to  L (I)  (C ( T ) ) . It  is 

P 

clear  that  T is  a bounded  linear  operator  from  L (J)  to  L (I) 

P P 

(C(J)  to  C(I)).  The  condition  (2.19)  guarantees  that  polynom- 
ials of  degree  r are  extended  to  the  same  polynomial. 

If  geW^  p(J)»  then  TgeW^  p(I)  since  the  only  point  in 
question  is  x«0,  where  we  have  continuity  for  each  function 
(Tg)^\  j*0, 1 , . . . , r-1  because  of  (2.19).  Also,  it  is  clear 


that 


(Tg) 


(D, 


(I)  < C Mg 


(r)| 


(J)  with  C depending  only  on  r. 


Hence,  if  f is  in  Lp(J)  (C(J))  and  g is  an  arbitrary  function 

in  W (J),  then  for  t>0, 
ftP 

||A*(Tf,x)||p(I)<  l|A^(T(f-g)  ,x)|l  (I)  + |lA^(Tg,x)Hp(I) 

< 2r  ||T(f-g)||p(I)  + tr  H (Tg)  (r)|! p (I) 

< C{|lf-g|l  (J)  + tr  "g(r)l|  (J)}  . 

- t p P 

Taking  now  an  infimum  over  all  such  g on  the  right  side  and  a 
supremum  over  all  t<h,  we  have  from  Theorem  2.1  that 

u (Tf  ,h)  < CK  (f,hr)<Cm  (f,h). 
r,p  - r,p  - r,p 

This  shows  that  T preserves  the  smoothness  of  f. 

THEOREM  2.5.  If.  [a.b]  C [c,d],  r>l,  and  l<p<°°,  then  there  is 
£ linear  operator  T mapping  Lp  [a ,b]  into  Lp  fc  ,d] , l<p«» , and 

c[a,b]  into  c[c,d],  p*00,  such  that  when  feLp[a,b]  (feC[a,b]), 
then  Tf  (x)  ■ f(x),  xc[a,b]  and 

(2.20)  u (Tf ,h)  < Cm  (f.h),  h>0, 

r.P  “ r,p 

wi  th  C a_  constant  depending  only  on  r and  the  two  intervals . 
Proof.  We  have  seen  this  theorem  for  extending  from  [o,l]  to 
[-l,l1.  The  same  idea  gives  an  extension  from  [-l,ol  to  F-l,ll. 
These  two  results  can  then  be  combined  with  usual  linear  change 
of  scale  to  give  the  general  result. 
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We  have  seen  a few  ways  that  the  K-  functional  can  be  used 
for  approximation  related  questions.  The  first  general  descrip- 
tion of  the  role  of  the  K-functional  in  approximation  was  given 
by  J.  Peetre  [27]  (see  also  [28]).  p.  L.  Butzer  - H.  Berens  [7] 
and  H.  Berens [2]  develop  the  uses  of  the  K-functional  in  pro- 
blems relating  to  the  degree  of  approximation  , especially  for 
semi-groups  of  operators.  A general  treatment  for  direct  and 
inverse  theorems  of  approximation  is  given  by  P.  L.  Butzer  - 
K.  Scherer [8],  The  K-functional  seems  to  have  had  some  of  its 
earliest  and  most  powerful  uses  in  spline  approximation  as  for 
example  already  in  R.  Varga [35]  and  K.  Scherer [3l] . We  will  see 
many  more  uses  of  the  K-functional  approach  in  the  following 
sections . 

3 Local  Approximation  by  Polynomials  of  Fixed  Degree 

We  want  to  estimate  the  error  in  approximating  f on  an  in- 
terval [a,b]  I by  polynomials  of  degree  < r-1  in  terms  of  the 
r-th  order  modulus  of  smoothness.  Thus,  in  distinction  to  the 
Jackson  theorems,  here  the  degree  is  fixed.  The  importance  of 
obtaining  such  estimates  is  twofold.  First,  spline  approxim- 
ation can  be  viewed  as  a smooth  joining  together  of  local  poly- 
nomial approximations.  This  means  our  estimates  will  be  useful 
in  the  next  section.  But  beyond  the  spline  approximation,  loc- 
al polynomial  approximation  will  give  another  view  of  the  con- 
nections between  degree  of  approximation  and  smoothness. 

THEOREM  3.1.  I_f  f eL^  [a , b]  and  6mfar  ^ 1 b-a  I , then  there  is  a_ 

polynomial  Q o_f  degree  r-1 , such  that 

r 

(3.1)  H f -Qll  [a, b]  < C{J  IhJ  (f.x)||  (J  )du 
v o p 

+ max  || AY.  (f ,x)||  (J  )}  < Cgo  (f,|b-a|), 
l<k<r  0K  p r°K  r,p 

with  ■ [a,b-t] , J ■ fa,bj , and  C a constant  depending  only 
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on  r. 

Proof . It  is  enough  to  prove  (3.1)  for  [a,b]  »I.  The  general 

_ 2 

case  follows  by  the  usual  change  of  variables.  So 

Suppose  that  feL  (I)  and  let  g be  as  in  Lemma  2.2  for  t»<5 . Con- 

. P (r-1) 

sider  the  polynomial  Q(x)  - g(0)+g' (0)x+* • * + (g  ' (0) / (r-1 ) ! ) • 

r-1 

x , which  is  the  first  r terms  of  the  Taylor  expansion  of  g 
about  0.  For  each  l<i<r  and  -l<x<i,  we  have 


tg(l"1)(x)-Q(l"l)(x)l 


1/  (g(l)(u)-Q(l)(u))du! 


1 llg(i)-Q(i)|lp, 


because  of  Holder's  inequality.  Hence, 

(i>  „(i)| 


ig(i-l).Q(i-l)|I  < 


g'  -Q 
(3.2) 


, i-1,2, . . . ,r.  This  gives  the  estimate 


g-q|lp  ; 


g 


(r)i 


< C6 


'g 


(r), 


Therefore,  if  we  estimate 


I g-Qll  and  use 
P 


f-olL  < •If-gIL  ♦ 

P P 

(2.16)  on  the  first  term  and  (2.17)  along  with  (3.2)  on  the  se- 
cond term,  we  obtain  the  estimate  (3.1)  as  desired. 

The  middle  estimate  given  in  (3.1)  is  more  sensitive  than 

w (f,'b-a  ) in  the  sense  that  we  can  add  up  such  estimates 

r,P  | . 

and  still  retain  an  inequality  of  the  same  form.  The  except- 
ional case  is  p*00  where  the  modulus  of  smoothness  suffices. 

In  the  case  p**°°.  Theorem  3.1  wa8  given  by  H.  Whitney  [34]. 
Whitney's  proof  is  more  complicated  but  includes  a sensitive 
estimate  of  the  constants.  The  proof  we  have  given  is  made 
simple  by  the  use  of  the  K-functional . 

The  estimate  (3.1)  is  of  course  an  estimate  for  the  degree 
of  approximation  in  terms  of  smoothness.  Actually,  these  est- 
imates can  be  easily  reversed.  Consider  the  simplest  case  when 
p«®.  Let  Er_j(f,J)  denote  the  error  in  approximating  feC(J)  in 
IHIJJ)  by  polynomials  of  degree  r-1.  From  Theorem  3.1  it  fol- 
lows that  if  t>0,  then 

(3.3)  sup { Ef_1(f,J)  : JCI,  | J | < rt}  < Cojr(f,t) 
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The  reverse  inequality  also  holds,  since  if  t>0  and 
^x,x+rtj  Cl,  with  0<  h<  t , J - [x,x+rh]  and  Q any  polynomial  of 
degree  r-1 , then  we  have 

U£(f,x)l  - | A^(f-Q.x)  | < 2rl|f-d'p). 

Taking  an  infimum  over  the  Q's  on  the  right  hand  side  and  a sup- 
remum  over  xe I , h<t  on  the  left  hand  side  gives  the  converse 
to  (3.3). 

If  J - [a.b]  and  is  any  bounded  projection  from  L^(J) 
into  the  space  of  polynomials  of  degree  < r-1,  then  the  follow- 
ing is  a simple  consequence  of  Theorem  3.1. 


COROLLARY  3.1.  There  is  a constant  C depending  only  on  r,  so 
“2  ' 

that  if  6*^r  | b-a [ and  feL  (J) , then  for  any  projection  , 

II  f-Pr  (f  )||  (J)  < C(l+||Prll){  / Ku(f,x)!!(Jr6u>du 


+ l<k<r!lA^(f,X),'p<Jr6k)^  1 Cu)r,p(f*lb-a'>* 

with  - [a,b-t]  , J - [a,b]  , and  C depending  only  on  r and  '|P^_||  t 

Proof . If  Q is  any  polynomial  of  degree  r-1,  we  have 

II  f-P  (f)ll  (J)  < |!f-o||  (J)  + |lp  (f-0)l|  (J) 

r p - p r p 

< (l+||Prll)||f-Q|lp(J). 

If  we  now  take  Q to  satisfy  Theorem  3.1,  we  obtain  the  desired 
estimate. 


4 Approximation  by  Splines 

Let  H - |x.\n  be  a set  of  knots  with  -1-x  <x  <**»<x  *1 . 

1 l Jo  o 1 n 

If  r>l,  then  denote  by  S (IT)  the  space  of  splines  of  order  r and 
knots  II.  Thus  Se^(IT)  if  and  only  if  is  continuous  and 

on  each  interval  (x.,x^+^1,  S is  a polynomial  of  degree  < r-1. 
When  r-1,  we  assume  left  continuity  at  each  knot  and  right  con- 
tinuity at  -1. 
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We  will  estimate  the  degree  of  approximation  by  splines 
from  S_(TI).  Our  estimates  automatically  include  estimates  for 
splines  with  multiple  knots  since  the  latter  splines  are  less 
smooth.  In  spline  approximation,  it  is  customary  to  measure  the 
degree  of  approximation  in  terms  of  the  mesh  ratio  In'  = 
max  (x^j-x.),  while  holding  the  order  r fixed. 

If  on  the  other  hand  the  knots  are  held  fixed  and  the  de- 
gree tends  to  infinity  then. this  is  akin  to  polynomial  approx- 
imation. When  both  r and  n tend  to  infinity,  then  one  can  to  a 
certain  extent  superimpose  the  results  from  the  other  two  cases, 
but  care  must  be  taken  in  the  estimation  of  the  constants.  W’e 
will  only  consider  the  case  where  r is  fixed. 

An  important  role  in  the  construction  of  spline  approxi- 
mants  is  played  by  the  B-splines.  These  splines  have  minimal 

support  and  are  a basis  for  § (n).  Define  x.  - -l-i(x  +1), 

r 1 1 

i<0,  and  x.**l  + (l-x  .),  i>n.  This  extends  the  knots  with  the 

1 . r-1 

same  sort  of  spacing.  Let  g^Csjx)  = (s-x)+  and  for  -r<i<n, 

define 


(4.1)  N.^(x)  - (xi+r-x.)gr(x. x.+r;x), 

with  the  notation  meaning  that  we  are  taking  an  r-th  divided 

difference  with  respect  to  the  variable  s while  holding  x fixed. 

The  spline  N.  is  non-negative  and  vanishes  outside  of 
i,r 

(x.,x.  ).  The  B-splines  N.  , -r<i<n,  form  a basis  for  £ (H) 

l l+r  r i,r  ’ r 

and  with  our  normalization,  we  have  J N.  (x)  - 1,  xel. 

-r< i<n  1,r 

To  construct  spline  approximants , we  can  follow  our  general 

plan.  In  other  words,  we  want  first  to  construct  approximants 

to  smooth  function  gcW  (I).  For  such  a function  g,  we  will 

r ,p 

show  that  there  are  splines  , j*l,...,r,  such  that  R.eJj(JT), 

l|g(r'j)-s.||p  i <r|n|)j||g(r)ll1>. 

Of  course,  we  need  only  the  case  j-r.  but  the  intermediate  re- 
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suits  are  used  since  the  proof  goes  by  induction. 

Let  us  sketch  the  construction  which  follows  the  ideas  of 
B.  Popov  and  Bl.  Sendov  [29|  , Subtracting  a polynomial  of  deg- 
ree r-1  if  necessary,  we  can  assume  that  g^^(-l)  ■ 0,  i»0,..., 
r-1.  Let  S be  the  step  function  in  S (H ) with  S (x)  - 

/IV  ^ 

g J(x.),  x^<x<x^+^.  Then,  by  Holder's  inequality, 

(4.2)  |g(r"1)-S1(x)|P  < { / 1 |g(r)(t)'dt  }P 

X . 

1 

- ' xi*i_xi I p/q  > 1,1  U<r)(t>'pdt. 

x . 

l 

when  x^<x<x^+^.  Here  q is  the  conjugate  index  to  p,  l/p+l/q«l. 
Integrating  this  last  inequality  over  [x.,x^+^],  replacing 
lx^  + ^-x^|  by  | II  | and  then  summing  over  i and  taking  a p-th  root 
gives  'lg^r  ^-Sjjlp  < "g^r^M  In',  which  is  the  case  j*l. 

Now  to  the  induction  step.  Suppose  that  S.eS.  (II)  satis- 
fies ||g(r_j)-S.||p  < (r  | IT  | ) ^ |lg(r)||p.  Let  X-[n/r]and  y^x^., 
i*0,...,X.  Define  for  0<i<X, 


a . . 
i.J 


/ 1 1{g(r_j)(t)-s .(t)}dt, 

y;  J 


-l 

c . . 
1,1 


N . . ( t ) d t. 
ri « J 


Note  that  N . . is  supported  on  (x  . ,x  . , . ) C (y . ,y . . , ) • The 
n,j  ri  n+j  - i i+l 

spline  defined  by 

x X-l 

S.  , (x)  - / {S . (t ) + 7 a.  . c.  . N . .(t)}  dt 

J+l  _[  1 J .“Q  i.J  i.J  n,j 


is  in  £.+1(IT)  and  satisfies  ||g^r  ^ ^-S. 


Hg'  - -Sj+i'!p  1 (r'nl) 


j+V 


(r), 


. This  is  shown  in  the  same  way  we  have  estimated  (4.2) 


r 

above,  except  that  now  we  use  the  fact  that  S.(y.)  ■ 

g^r  J ^ (y  ) t 0<i<X,  and  |y.+^-y.  | < r I IT  j , as  well. 

The  above  construction  shows  that  for  each  geW  (I), 

^ * P 

there  is  a spline  Sc  $ (IT)  such  that 
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(4.3)  I!  g-sl!  < (r'n|)r  l!g(r)ll  . 

p - p 

This  is  the  desired  result  for  smooth  functions. 

THEOREM  4.1.  For  each  feL^(I)  there  i s an  SeS^(H)  such  that 

(4.4)  |!f-sl|  < Cd  (f,!nl), 

P " r,p 

with  C a constant  depending  only  on  r. 


Proof.  This  follows  from  Theorem  2.2  and  the  estimate  (4.3). 

There  is  another  approach  to  obtaining  spline  approximants 

given  by  C.  deBoor  - G.  Fix  , which  we  also  want  to  look  at. 

Their  technique  yields  linear  projections  from  L (I)  to  £ (") 

p t 

which  give  the  estimate  (4.4). 

We  have  already  mentioned  that  the  B-splines  form  a basis 
for  £^(TT).  Thus  any  spline  Se$^(H)  can  be  represented  as 

/ a.(S)N.  , with  the  a.'s  linear  functionals  on  S (^)  and 

u i i , r i r 

a.(N.  ) = 6.  . (Kronecker  delta  notation).  There  are  explicit 

1 J,r  1,J  ri 

formulae  for  the  a^'s  given  in  [,4],  For  example,  if  t e 

(xrxj+r).  ^(x)  - (x.+1-x)-.-(x.+r_rx)  and 

(r-l)!n.  . - (-l)r"k_1  '?.(r"k'1)(T.)> 

1 » x J J 

then  the  functional  a^ , -r<j<n,  can  be  represented  by 

(4.5)  a (S)  - l n.  ,S(k)(T  ). 

J k<r  J ’ J 


It  is  clear  from  (4.5)  that  the  functionals  a.  have  sup- 
port on  (Xj,Xj+r).  We  want  to  bring  this  out  more  clearly  in 
a manner  in  which  we  work  with  S and  not  its  derivatives.  We 

are  still  at  liberty  to  choose  the  t.'s.  For  -r<j<n,  let  I. 

J J 

be  a largest  interval  of  the  form  (x.,x.+j)  contained  in  T A 

(x.,x,  ).  In  case  of  a tie  we  choose  I.  as  the  left  most  such 

1 J+r  j 

interval.  Our  selection  gives  that  I.£l  and  rll.l>lx.  -x . I , 

J J ~ J+r  J 

-r<j<n,  since  the  knots  have  a uniform  spacing  outside  of  I. 
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For  each  j,  let  T ^ be  the  midpoint  of  1^  . 

LEMMA  4.1.  Let  l^p<°°.  There  is  a constant  C that  depends  only 
on  r such  that  for  each  Se^(II),  we  have 

(4.6)  a j ( S ) | 5C!xj+r~Xj|  1/p  1|  SH  (I  j ) » ~r<j<n. 

Proof . First  observe  that  there  is  a constant  C depending  only 
on  r such  that  for  any  polynomial  Q of  degree  < r-1 , we  have 
ML  1 c|| Q||  < c||q||  • This  is  because  on  a finite  dimensional 

00  — I “ P 

space  any  two  norms  are  equivalent.  Changing  this  inequality 
to  (a,b)  gives 


(A.  7)  ML<«.b)  < C | b-a  | 1/P  Ml  (a,b). 

oo  — p 


Now  consider  the  representation  (4.5)  for  a.. 

formula  for  the  n.  ,'s,  it  follows  that 
J ,k 

(4.8)  |n.  , ! < (x  -x.)k,  -r<j<n,  0<k<r. 

J,k  J+r  j ~ 

Hence,  if  we  use  Markov's  inequality  with  (4.8)  and 

f ind 


r-1 


(S)|  1 [ (x.  -x.)K  IS(k)(T.) 


k-o 


J+r  J' 


From  the 


(4.5),  we 


< c||s|L(i;)  j 

k*o 


J 


X . -X  . 

J+r  j 


lkii. 

1 


-k 


ci,sUy> 


lx.  -x . I < r 1 1 . 
J+r  J - J 


with  C depending  only  on  r.  Recall  that 

Using  (4.7)  to  write  this  last  inequality  in  terms  of  the  L 

P 

norm,  we  arrive  at  the  desired  result  (4.6). 

The  functionals  a.  are  only  defined  on  S (IT)  but  we  can 
j r 

use  the  Hahn-Banach  theorem  to  extend  these  functionals  to  all 
of  Lp(I)  (C(I),p“<»)  with  the  preservation  of  the  estimate  (4.6). 
Because  a^  is  supported  on  I.,  it  has  a norm  preserving  exten- 
sion a.  to  L (I.).  Let  y.  denote  the  characteristic  func- 
J p J J 

tion  of  I.  and  define  a.(f)  = a.(f*Y.)  for  each  feL  (I).  Then 
J J J J _ P 

for  any  spline  S,  we  still  have  that  cu(S)  - a.(S’x.)  * a.(S). 
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For  simplicity  of  notation,  we  do  not  indicate  the  dependence 

of  the  extended  functional  a.  on  p. 

J 

The  functionals  a.  will  now  satisfy  for  ftL  (I)  (ftC(I), 
.1  P 

p=»~) , 


(4.9)  o .(f)  < dx.  -x.l 

1 " 1 +r  J 

Define  now  the  operators 


f II  (1 . ) , -r  <j  <n, 
P J 


(4.10)  L (f  x) 
IT.  P 


l a • (f  )N.  (x) , feL  (I)  (fc  C(I)  ,p-  »). 


-r<  i<  n 


These  are  projections  from  L (I)  onto  S (II).  These  operators, 

P r 

since  they  are  projections,  are  as  effective  as  any  spline  app- 
roximation provided  there  norms  are  controllable.  So  our  next 

task  is  to  check  the  norm  of  Ln 

n,P 

If  0<_j<n,  then  only  the  indices  i with  j — r<  i<  j contribute 

to  L (f,x)  when  x.<x<x.  Also,  the  N.  's  are  non-nega- 

n,P  ]~  ~ J+l  i,r 

tive  and  sum  to  one.  Therefore,  for  l<p<0°, 

x . , x . , 

j+l  j+l 

(4.11)  / !l_  (f,x)|Pdx  < max  'a.(f)IP  / dx 

n .p  “ . 1 

x.  i-r<i<i  x. 

J J 

< C max  { |x  -x.l  1 / |f(x)|dx  } |Xj+1“X.  | 
j~r<i<j  I. 


< C max  } | f (x) I Pdx , 

j-r<i<j  I. 

with  C depending  only  on  r.  Here,  we  have  used  our  estimate 
for  given  in  (4.9)  and  the  fact  that  |x.+1-x^|  < |x.+r~x. | 
j-r<i<j . 

Now,  for  each  j,  the  max  in  (4.11)  is  taken  only  over  r 
terms  and  for  any  given  interval  th^s  interval  can 

appear  at  most  r times  in  the  sequence  of  intervals  (Ij). 
Therefore,  we  can  sum  the  inequalities  (4.11)  over  0<j<r  and 
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(4.12)  II L (f )H  < C llfll  , feL  (I)  (feC(I),  p-«)  . 

n,p  p - p p 

We  have  actually  only  proved  the  case  l<p<«=  in  (4.12)  but  the 
case  p-03  is  the  same  sort  of  argument.  From  (4.12),  we  have 

(4.13)  ||L  ||  < C, 

II,  p - 

with  C a constant  depending  only  on  r. 


THEOREM  4.2.  Let  l<p<°°,  and  define  L^  as  in  (4.10).  Then, 

— *-,0 

there  is  a_  constant  C depending  only  on  r such  that 


(4.H)  l|f-injp«)llp  ; c»r_p(f,ln|). 

Proof.  If  S is  any  spline  in  §^(11), 
jection,  we  have 


feL  (I)  (feC(T),  p-«) . 
P 


then  since  L„  is  a pro- 
H,P 


l|f-L  (f)l|  < |lf-si|  + ||l 

n,p  p - p 

Now,  taking  S as  in  Theorem  4.1 
desired  result  (4.14). 


and  using 


< (1+|!  L J)  Hf-Sll. 

n , p p 

(4.13),  we  have  the 


Our  proof  of  Theorem  4.2  relied  on  Theorem  4.1  and  hence 
in  turn  on  Theorem  2.1.  However,  it  is  possible  to  prove  Theo- 
rem 4.2  directly  by  using  the  estimates  of  Theorem  3.1.  In 
this  way,  we  can  get  another  proof  of  Theorem  2.1.  Namely, 

take  n equally  spaced  knots  for  the  knot  set  II  . If  (n+1)  ^ 

-1  , n 
<t^n  , let  g = L (f)  for  r+1  in  place  of  r.  Then  g will 

n’p  . . r i 

give  the  crucial  right  hand  estimate  in  (2.7)  (see  [.  1 5j ) . 

In  the  case  p=<=°,  there  is  an  additional  benefit  to  arguing 
directly  since  then  we  can  get  local  estimates  for  the  approx- 
imation as  shown  in  the  next  theorem. 

THEOREM  4.3.  Let_  Ln  - L^  as  jjn  (4.10).  If  feC(I)  and 

x.<x<x.  , , 0<i<n,  then 
j-  - j+1  -J 

(4.15)  |f(x)-Ln(f,x)  | < Cur(f,!xj+r-x._r|), 

with  C a constant  depending  only  on  r . 

Proof.  Let  Q be  a polynomial  which  satisfies  (3.1)  for  p=°° 
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and  [a,b]  = (V _r » x j +rl • Then, 

(4.16)  [[  f— qI!  [x.  ,x.  1 < Coi  (f , lx.  -x.  |). 

' lo°L  j-r*  j+ra  - r * j+r  j-r1 

Now,  N.  (x)  = 0 if  ii  [j-r+1 , j]  , and  the  N.  's  sum  to  one. 
1 * r i,r 

Hence,  if  Xj<x<Xj+^, 

(4.17)  |Ln(f-Q,x)|  < max  |a.(f-Q)J  < C max  J I f — qI  ! (1^) 

j-r<i<j  “ j-r<ifj 


< c l|f-QlL^_r,xj+r], 


because  of  (4.9).  Since  preserves  polynomials  of  degree  r-1, 
we  have  from  (4.16)  and  (4.17) 


I Ln (f ,x)-f (x) | < |Ln(f-Q,x)|  + | f (x)-Q(x) j 

< C l|  f-Q||  [x.  ,x.  1 < Co)  (f , lx.  -x.  I), 

°°L  j-r  j+r-1  - r ’ j+r  j-r  ’ 

with  C depending  only  on  r.  Here,  we  used  the  fact  that  is 

bounded  and  (4.16).  This  is  the  desired  estimate  (4.15). 

~2  2 

Let  us  point  out  one  special  case.  Let  y^  = 1-n  (n-i)  , 

i=0,l,...,n  and  y_.=y.,  i=l,2,...,n.  Define  the  knot  set  H= 

fx.}  , with  x.  = y.  , i=0,l 2n.  These  knots  are  thicker 

1 ° 1.  1_n  -1  2 Jr  -2 

near  the  end  points.  If  we  let  = max{n  (1-x  )^,n  }, 

then  from  Theorem  4.3  it  follows  that 


(4.18)  | f (x)-Ln(f  ,x)  | < Coir(f  ,&n(x))  , -1<x<1. 

This  estimate  is  a spline  analogue  of  the  Timan  type  estimates 
for  algebraic  polynomial  approximation  (see  Section  7) . For 
splines,  such  pointwise  improvements  can  also  be  made  in  the 
interior  of  the  interval  by  allowing  the  knots  to  thicken  near 
a point  or  a finite  number  of  points.  Such  interior  improve- 
ments are  not  possible  in  polynomial  approximation  Til, p. 190], 
There  is  a much  larger  selection  of  coefficient  function- 
als a.  which  when  used  in  the  definition  of  L„  will  yield  the 

J n,p 

Jackson  order  estimates  of  Theorems  4.1  and  4.2.  A careful 

analysis  of  the  proof  given  for  p=  °°  in  Theorem  4.3  shows  that 
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we  do  not  need  that  L_  is  a projection  but  it  suffices  that 

H » P 

polynomials  of  degree  < r-1  are  preserved  and  estimates  like 
(4.6)  and  (4.9)  hold.  As  we  have  mentioned  already,  Theorem  4.2 
can  be  proved  directly  for  general  p and  here  again  the  direct 
proof  requires  only  that  L preserve  polynomials  of  degree 

ii , p 

1 r-1  and  the  inequalities  for  a..  This  leads  to  a greater 

variety  of  possible  choices  for  the  coefficient  functionals. 
These  ideas  are  examined  in  detail  in  T.  Lyche  - L.  Schumaker 


[25]. 


5 Approximation  by  Trigonometric  Polynomials 

There  are  several  ways  of  deriving  direct  estimates  for 
approximation  by  trigonometric  polynomials.  Let  us  consider 
only  the  most  common  method  which  is  to  use  some  sort  of  con- 
volution type  operator.  Suppose  that  (K^)  is  a sequence  of 
non-negative  trigonometric  polynomials  of  degree  n and  nor- 
malized to  have  integral  over  [— r,ir]  equal  to  one.  The  opera- 
tors L defined  by 
n 

(5.1)  Ln(f,9)  = / { (-1)  £^(f,9)  + f (0 )}  K (t)dt,  feL  (2tt) 

-it  n p 

will  yield  the  Jackson  estimates  provided  that  the  moments  of 

the  kernels  satisfy  certain  inequalities. 

We  begin  by  checking  smooth  functions  in  W^_  (2tt)  . When 

geW  ( 2tt)  and  M ■ |[  g^  ||  ,then 

r»P  ^ P w 

(5.2)  ||  g-Ln(g)||  < /!|  A^(g,0)||  K(t)dt  < M / |t|rK(t)dt. 

Therefore,  to  have  the  correct  order  of  estimates  for  func- 
tions in  W (2t r) , we  need  that  the  r-th  moments  of  the  K 's 

^ * P _ j-  ft 

be  of  order  0(n  ) . 

The  best  known  examples  of  kernels  satisfying  this  moment 

condition  are  the  Jackson  kernels  J (t)  = c (sin(mt/2))^r. 

-2r  . p ~i  n n,r 

(sin(t/2))  , with  m = In/rJftftd  c a normalizing  constant. 

u J n , r 

The  kernels  J^  satisfy  (see  [22,  p.  57]) 
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7T 

(5.3)  / 1 t !rJn(t)dt  < Cn  r, 

-IT 

with  C a constant  depending  only  on  r. 

THEOREM  5.1.  Let  1 < p < » and  (K^)  b£  a_  sequence  of  non-nega- 
tive trigonometric  polynomials  of  degree  < n which  satisfy 

TT 

(5.4)  / |t|rKn(t)dt  < Cn  r,  n - 1,2,... 

-it 

with  C a constant  depending  only  on  r.  Define  the  operators  L^ 
£s  iji  (5.1).  Then  for  each  n > 1 and  f eL^ ( 2 tt)  , L^  ( f ) i_££  trig- 
onometric polynomial  of  degree  < n for  which 

(5.5)  ||  f-Ln(f)||p  f Cu^ptt.n”1), 
with  C a constant  depending  only  on  r. 

Proof.  Because  of  (5.4)  and  (5.2),  for  any  geW_  _(2it),  we  have 

^ ,P 

II  g”Ln(g)  ^1  p < Gn  ||  g !|  p.  Thus  the  theorem  follows  from 

# r 

Theorem  2.3,  since  each  L clearly  has  norm  < 2 . 

n - 

6 Approximation  by  algebraic  polynomials 

The  usual  technique  for  proving  Jackson  type  estimates  for 

approximation  by  algebraic  polynomials  when  p=°°  is  to  revert 

back  to  the  trigonometric  case  by  using  the  substitution  x « 

cos  0.  This  is  particularily  useful  for  pointwise  estimates  as 

described  in  the  next  section.  However,  in  the  case,  l<p<°°, 

this  method  is  not  satisfactory  since  when  feL  (I),  the  func- 

P 2 -a 

tion  g (0 ) = f (cos  0)  need  not  be  in  L (2ir)  (e.g.  f(x)«(l-x  ) , 

a*  3/4p).  Instead,  we  will  work  to  construct  algebraic  analog- 
ues of  the  operators  in  Section  5. 

Let  us  first  note  that  there  are  algebraic  polynomials  yn 
of  degree  < n which  are  non-negative  and  satisfy 

1/r 

(6.1)  / w (t)dt  - 1, 

, < n 
-1/r 
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3 

(6.2)  / It'S  (t)dt  < Cn 

o n 


with  C a constant  depending  only  on  r.  Examples  of  how  to  con- 
struct such  polynomials  for  r*2  were  given  in[lo]  and[ll].  The 
same  approach  carries  over  to  general  r.  A sketch  of  the  con- 
struction goes  as  follows. 

Let  P„  be  the  Legendre  polynomial  of  degreee  2m  and  x.  » 
Ztr  i ,m 

i*l m its  positive  zeros  written  in  increasing  order.  The 

polynomials 


X (t) 


n 


P2„(t> 


, 2 2 , , 2 2 , 
(x  -x,  ) • • • (x  -x  ) 

l,m  r,m 


with  m ■ [n/4]  have  degree  < n and  with  a proper  choice  of  c , 


/ X (t)dt  - 1, 
-1  n 


/ 1 1 1 r>.  (t)dt  < Cn'r. 

-1  n 


The  estimate  of  the  r-th  moment  of  X can  be  done  by  the  Gauss 

n 

quadrature  formula  when  r is  even  and  by  using  the  Cauchy- 
Schwartz  inequality  and  then  the  Gauss  quadrature  formula  when 
n is  odd.  Now  switching  to  [-3,3]  and  adjusting  the  constant 
so  that  (6.1)  holds  will  give  polynomials  that  satisfy  (6.1) 
and  (6.2).  Note  that  the  integral  of  X on  [-l,l]  - [-l/r,l/r] 
is  0(n  ) because  of  the  moment  inequality. 

Now  to  the  construction  of  the  operators.  Because  the 
functions  are  not  periodic,  we  need  to  work  with  some  sort  of 
extension.  We  can  use  the  extension  introduced  at  the  end  of 
Section  2.  Take  [a,b]  ■ [-l,l]  and  [c,d]  * [-4,4]  and  let  T 
be  the  linear  operator  guaranteed  by  Theorem  2.5.  For  simplic- 
ity of  notation,  we  writ®  f in  place  of  Tf . 

For  each  l<k<r  and  -1<x<1, 

2 (2-x) /k 

(6.3)  / f(t)p  (k_l(t-x))k_1dt  - / f (x+kt)u  (t)dt 

-2  " n (-2-x) /k  " n 
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is  an  algebraic  polynomial  of  degree  < n.  Although  the  limits 
of  integration  of  the  term  on  the  right  hand  side  are  variable 
we  shall  soon  see  that  only  the  integral  over  [-6,6],  with  6 = 
1/r  is  significant. 

The  analogue  of  the  trigonometric  operators  (5.1)  are 

2 r . , , 

(6.4)  4>  (f,x)=  / f(t)<t>  (t-x),  <(>  (u)=  (-1)  (£)k  y (k  u) 

n ' - n n , n 


Using  (6.3),  ^(f)  can  be  written  in  terms  of  the  r-th  differ- 
ence of  f.  For  each  l<k<r,  we  have  (-2-x)/k  < -6  < 6 
< (2-x)/k.  Thus  the  right  hand  integral  in  (6.3)  can  be  writ- 
ten as  an  integral  over  [-6, 6]  and  a remainder  IL(x)  . Hence, 


(6.5) 


*n(f,x) 


= / { ("l)r+  A*(f,x)  + f(x)}  y (t)dt 

-6  n 

l W+1 

+ 1 (_1)  C J) (x)  = (f  ,x)  + B^(f  ,x)  . 


The  operator  A^  is  now  of  a form  akin  to  (5.1).  We  want 

now  to  estimate  B . Let  J , = [(-2-x) /k, (2-x) /k]  - [.-<5,6], 
n x f k 

and  J = { t : 6 < 1 1 1 < 3 } . For  each  l^k<r  and  -1<x<1,  J C.J 
and  so 

H*lJlp  " II  / f (x+kt)yn(t)dt||p  < ||f||p[-4,4]  / yn(t)dt 

J , J 

x,k 

1 llfllp  / (r|t|)ryn(t)dt  < Cn  r ||f||p, 


with  C a constant  depending  only  on  r.  In  the  second  to  last 
inequality,  we  used  the  fact  that  r|t|  >1  if  |t|  >6,  and 
the  fact  that  II  f II  [-4 ,4]  < c||f||p,  since  T is  a bounded  oper- 
ator. In  the  last  inequality,  we  used  our  moment  estimate 
(6.2).  This  shows  that 

(6.6)  ||Bn(f)||  < C ||f||  n“r,  feLp(I). 
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We  want  now  to  check  A . If  geW  (I),  then 

n r,p 

6 


(t}  r 

Here  we  have  used  the  fact  that  g=g  on  I,  ||g  '||  [-2,2]  < 

||g(r)||  [-4, 4]  < C llg^l!  , since  T preserves  smoothness. 

H p - P 

LEMMA  6.1.  Let  l<p<°°.  If.  n>l , define  4>  (6.4).  For  each 

feL^(I) , $n(f)  j_s  an  algebraic  polynomial  of  degree  < n and 

(6.8)  ||f-$  (f)||  1 Cfco  (f,n-1)  + !|f||n"r}, 

up  r , p p 

with  C £ constant  depending  only  on  r . 

Proof.  Because  of  (6.7)  and  Theorem  2.3,  for  each  feL  (I),  we 

“1  P 

have  |!  f-A  (f)lj  < Coj  (f,n  ).  We  have  already  seen  that 
n p - r,p  7 

||  B (f)||  < C ||  f ||  n r by  (6.6).  Since  $ = A + B , these  two 

n "p  " p J n n n 

facts  prove  the  lemma. 

We  can  remove  the  unpleasant  term  ||  f||  r by  a minor  ad- 
justment. Given  f,  let  F be  the  primitive  of  f with  F(0)*0 
and  let  Q be  the  polynomial  of  degree  r which  interpolates  F at 
the  points  -l+2ir  i«=0,l r.  Define  the  projection  P 

from  L (I)  onto  polynomials  of  degree  < r-1  by  P (f)  *=  Q'. 

P r 

Consider  now  the  operator 

(6.9)  L (f)  = <t>  (f-P  (f))  + P (f). 

n nr  r 

For  n>r,  L^(f)  is  still  a polynomial  of  degree  < n,  but  now  L^ 
preserves  polynomials  of  degree  r-1. 

THEOREM  6.1.  Let  l<p<®.  Define  Ln  ££  in  (6.9) . Then,  there 
is  £ constant  C,  depending  only  on  r,  such  that  for  n>r,  fe 
Lp(I),  L^(f  ) is  a polynomial  of  degree  < n with 

(6.10)  ||  f-L  (f)||  < Coj  (f,n_1). 

n p - r , p 
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Proof . We  have  from  the  definition  (6.9)  and  from  (6.8)  that 

(6.11)  ||  f-L  (f)||  <||<t  (f-P  (f))-(f-P  (f)|| 

n p - n r r p 

< Cu>  (f,n_1)  + Cn~r  ||f-P  (f)||  . 
r,p  r p 

The  last  term  can  be  estimated  by  using  Corollary  3.1  to  find 

n”r  ||f-P  (f ) ||  < Cn“rw  (f  ,2)  < Cco  (f,n_1), 

" r 1 p - r ,p  - r ,p 

where  we  have  used  the  usual  properties  of  the  modulus  of 

smoothness.  When  this  last  estimate  is  replaced  in  (6.11),  we 

obtain  (6.10),  as  desired 

In  the  case  that  r*l,  J.  Bak  - D.  Newman  [l]  have  shown 
the  existence  of  polynomials  which  when  replaced  for  L^(f) 
will  give  (6.10).  This  is  done  in  the  more  general  context  of 
MUntz-Jackson  theorems.  Suprisingly,  I could  find  no  reference 
to  the  general  case  of  r. 

7 Pointwise  approximation  by  algebraic  polynomials 

It  was  first  observed  by  A.F.  Timan  T32]  that  in  the  case 
p-°°,  the  approximation  of  f by  algebraic  polynomials  can  be  im- 
proved near  the  end  points  of  the  interval.  These  improvements 
are  necessary  if  we  want  a direct- inverse  theorem  characteriza- 
tion of  classes  of  functions,  e.g.  Lip  a,  0<a<l,  in  terms  of 
approximation  by  algebraic  polynomials.  J.  Brudnyi[.5l  has  ex- 
tended Timan' 8 results  to  arbitrary  by  showing  that  for  each 
feC(I),  r>l,  n>r,  there  is  an  algebraic  polynomial  P of  degree 
< n such  that 

(7.1)  | f (x) — P (x) | < Cmr(f,An(x)),  -l<x< 1 , 

with  C a constant  depending  only  on  r and  A (x)  ■ maxfn  * • 

2 i _2  ^ 

(l-x  )",  n }.  The  case  r-1  of  (7.1)  is  the  original  result  of 
Timan,  while  r-2  was  proved  independently  by  G.  Freud  [l8]  and 
V.  Dzadyk[l6].  Actually,  any  of  the  trigonometric  operators 
introduced  in  Section  5 lead  to  algebraic  methods  that  give 
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(7.1). 


If  we  want  to  establish  (7.1)  using  a sequence  (A  ) of  al- 


gebraic polynomial  operators,  then  according  to  Theorem  2.4,  we 

only  need  that  for  each  geW  (I), 

r,p 

(7.2)  !g(x)-An(g,x)!  < C(An(x))r  |lg(r)1L,  -1<*1, 


with  C a constant  depending  only  on  r. 

Let  (K  ) be  a sequence  of  trigonometric  polynomials  with 

K of  degree  n and  integral  one  over  rr  ,ir"l  , K >0  and 
n tt  . n 

(7.3)  / |t|JKn(t)dt  < Cn"J,  j -1 , 2 , . . . , 2r . 

— 7T 


For  example  the  K^'s  that  satisfy  (5.4)  with  2r  in  place  of  r 
will  satisfy  (7.3).  Define  L as 

* 9 

(7.4)  L (h,0)  « / {-A^r (h,0)  + h(9)}K  (t)dt,  hEC(2"). 

-it 


Denote  by  P (f)  the  polynomial  of  degree  r-1  which  interpolates 

y -1 

f at  the  points  -l+2ir  , i«l,...,r.  Then  for  feC(I),  define 

(7.5)  A (f,x)  ■ L (f(cos  B)-P  (f,cos  0),  cos  ^x)  + P (f,x). 
n n r r 

The  use  of  P^  is  to  guarantee  that  A^(Q)  - 0 whenever  0 is  a 

polynomial  of  degree  < r-1.  These  operators  will  satisfy  (7.2) 

as  we  proceed  to  show. 

The  case  r*l  is  the  easiest  to  verify.  If  'g'|<M,  a.e. 
and  h(0)  - g(cos  0),  then  | h * (0 ) | ■ ^sin  0|lg'(cos  0)1  < 

Mlsin  0 | • a.e..  From  this  one  easily  checks  that 

|A*(h,e)|  < |h(0+2t)-h(0  + t)  | + lh(0+t)-h(0)  I 
< 3M( ! sin0 | | t | + t^) . 

This  gives  that 

" 7 

|h(e)-Ln(h,0)|  < / |A  (h,0) |Kn(t)dt 

“TT 

TT  q 19 

1 3M  / (Ml  sin  0 | +t  )K  (t)dt  < CM{)sin0]n  + n } 

-IT 
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because  of  (7.3).  When  this  is  written  in  terms  of  A , we  get 

n 

(7.2). 

The  essential  property  that  allowed  us  to  give  the  point- 
wise  estimate  above  was  that  the  appearance  of  the  sin  9 in  h' 
could  be  turned  into  a profit.  The  following  lemma  is  the  same. 


LEMMA  7.1.  Let  L be  defined  as  in  (7.4)  with  K satisfying 
n 7-t n 1 “■ 

(7.3).  _If_  heC(2Tr)  wi th  ^h  (9)'  < A ^sin9l'J,  a.e.,  and  0<p<r, 
p+v  < 2r,  then 

^ L^  (h , 9 ) — h (9 ) j < CAn  V(lsin  9^  + n y),  — tt  <9  <tt  , 

with  C a_  constant  depending  only  on  r. 

Proof . The  proof  is  similar  to  that  given  above  for  r**l  . See 
[15]  for  d etails.  Similar  arguments  are  also  given  in  {.22} . 


Any  function  h(9)  = g(cos  9)  with  geW  (T)  can  be  decomp- 

r >°° 

osed  into  functions  which  satisfy  Lemma  7.1. 

LEMMA  7.2.  Let  L^  be  as  j_n  (7 .4)  . Tf  geW^  (I)  with 
< M,  i**0,l,,..,r  and  h(9)  = g(cos  6),  then 

(7.6)  |h(9)-L  (h,9)  l<  CM(An(cos  9))r,  -rr<9<7T, 


wi  th  C depending  only  on  r . 

Proof . It  is  enough  to  consider  the  case  when  M * 1.  We  can 
decompose  h into  a sum  of  functions  which  satisfy  the  hypothe- 
ses of  Lemma  7.1. 

The  decomposition  goes  as  follows.  Differentiate  h r 
(r) 


times  and  write  h 


h(r)  + R(r)  where  h(r)(9) 
o o o 


(-sin9)rg^r\cos9)  and  the  collection  of  the  remaining 

terms.  The  functions  h and  R are  defined  as  the  r fold  mean 

o o 

value  zero  integrals  of  h^r  and  , respectively.  This 

convention  holds  throughout 
,(r) 


Now,  R 


(i) 


has  only  terms  involving  g with  i<r  and  thus 


has  another  derivative.  Write  R 


(r+1)  (r+1)  _(r+l)  ... 

« h,  + R,  , with 

oil 
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h^r  ^ the  collection  of  terms  involving  g^  and  B^r  ^ 

all  the  remaining  terms.  We  continue  in  this  way.  At  the  k- 

, , _(r+k)  , (r+k)  _(r+k)  . , (r+k) 

th  stage,  we  have  ^ - h^  + \ • wlt^  h^  all 

Cr ) (r  +k ) 

the  terms  involving  g and  the  rest  of  the  terms. 

We  stop  at  k-r-1  and  obtain  the  decomposition  h » h +*'*+hr, 

where  h is  defined  to  be  R +c,  with  c the  mean  value  of  h. 
r r-1 

It  is  easy  to  verify  that  each  h^,  k*0,...,r  satisfies 

Lemma  7.1  with  u -r-k  and  v -r+k  and  A a constant  depending 

4r 

only  on  r,  e.g.  A=(4r)  surely  suffices.  Thus,  from  Lemma  7.1 
we  find 

r 

! h (0 ) — L (h,0)|  < CA  ][  ('sin0|k  n ^+n  ^r)  < C(An(cos0))r 
n k«o 

for  — tt< 0 <tt  . Since  C depends  only  on  r,  the  lemma  is  proved. 


THEOREM  7.1.  let  L^  b£  ££  i_n  (7.4).  Define  £s  iri  (7.5). 

If  feC(I),  then  A (f)  _is  an  algebraic  polynomial  of  degree  < n 
such  that  for  any  n>r , 


|f  (x)-An(f  ,x)  ' < Cw^  (f  ,A^(x))  , -1<x<1, 


with  C £ constant  depending  only  on  r . 

Proof.  Consider  first  a function  geW  (I).  The  function  g, 
r ,°°  l 

■ g-Pr(g)  satisfies  the  hypotheses  of  Lemma  7.2,  with  M - 

2r  ||g(r)!l  . This  is  shown  by  using  the  fact  that  g.fr^  “ g^ 

( X ) ^ ^ 

and  g^  has  at  least  one  zero  in  I,  for  each  i*0,.,.,r-l 

because  of  Rolle’s  theorem.  Hence, 


< 


,r-i 


Now  making  the  substitution  x = cos  0 and  using  Lemma  7.2 
we  have 


(7.7)  I g (x)-A  (g,x)  | < C |!g(r)l|  (A  (x))r, 

n 00  n 


-1<X<1 , 


with  C depending  only  on  r.  This  is 
ions  geW^  ^(1).  Since  the  operators 


our  estimate  for  funct- 

A are  uniformly  bound- 
n 
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ed,  the  theorem  follows  from  Theorem  2.4. 

Let  us  now  turn  to  another  intriguing  question  concerning 

pointwise  approximation  by  algebraic  polynomials.  That  is, 

-2 

whether  it  is  possible  to  drop  the  n term  that  appears  in  the 

definition  of  A (x)  and  still  have  an  estimate  like  that  in 
n 

Theorem  7.1.  This  question  was  posed  be  G.G.  Lorentz  at  the 

Oberwolfach  conference  on  approximation  theory  and  also  by  S.B. 

Steckin  in  the  Soviet  Ur.ion.  Subsequently  S.  Teljakovskii  [3lJ 

showed  this  was  true  for  r=l  and  I.  Gopengauz [ 20]  gave  this 

for  r«2.  We  want  to  show  how  the  operators  A given  above 

n 

can  be  used  to  give  the  case  r-2,  and  this  of  course  contains 
the  case  r«l. 

Starting  with  the  operators  A^  of  Theorem  7.1  with  r“2, 
if  feC(I),  define 

(7.8)  \i  (f,x)  «=  A (f,x)  + i.  (f,x), 

n n n 

where  l (f,x)  *=  -£(x-l)  (f  (-l)-A  (f,-l))  + £(x+l)  (f  (1)  — A (f,l)). 
n n n 

9.  (f)  is  the  linear  function  which  interpolates  f-A  (f)  at 
n n 

the  end  points  -1  and  1.  In  this  way  p^(f)  is  an  algebraic  pol- 
ynomial of  degree  < n which  interpolates  f at  -1  and  1. 

In  order  to  establish  the  finer  pointwise  estimates,  it  is 
necessary  to  see  how  well  (A  (f ) ) * approximates  f'. 

LEMMA  7.3.  Let  A^  b£  the  operators  of  Theorem  7 . 1 with  r=2 . 

If  Q -A  (g) , with  geC(I),  ! g" | < M,  a.e.  on  I.  Then, 

n n - — 

(7.9)  |g'(x)-QVx)|  < CM(An(x)),  -1<x<1 , 
with  C an  absolute  constant . 

Proof.  This  proof  is  similar  to  that  of  Lemma  7.2  (see  [l5]). 
Using  arguments  similar  to  that  in  Lemma  7.2,  one  proves  that 
for  the  trigonometric  polynomial  T (9)  = Qn(cos  6)  and  h(9) 

■ g(cos  9),  we  have 

(7.10)  |h(l)  (9)-T(l)  (9)  I < CMn1  (A  (cos9))2,  i-0,1,2,  -*<9<tt. 

' n - n - - 
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This  estimate  can  be  improved  when  i-1  and  9 is  near  0 because 

then  h-T  is  an  even  function  and  hence  h'(0)-T'(0)  = 0.  When 

• sin  £|<n  , we  have  from  (7.10)  that  I h,f(0-T"(£)  I < CMn  . 

n — 

Hence,  using  the  mean  value  theorem,  we  have  for  9e  f — tt / 2 , tt / 2 T 
and  I sin  0 | < n \ that 

(7.11)  |h*  (e)-T'  (0)  | < lh"(C)-T"(0  | l6  ! < CMn""2  I sin0  ! . 

n - n - 

This  inequality  also  holds  when  0 ^ [ — tt / 2 , tt / 2 ] provided  that 

| sin  0|<  n \ because  h'  (iTr)-T'  (Irr)  = 0. 

“ n 

If  we  superimpose  (7.11)  and  (7.10)  for  i-1,  we  have 

I h ' (0)-T*  (0)  | < CM  ■ sin©  I A (cos  0),  n<0<Tr. 

n - n - - 

This  last  inequality  put  in  terms  of  g'  and  Q'  gives  (7.9). 

n 

THEOREM  7.2.  Let  be^  as  in  Theorem  7.1  for  r=2.  Define 

jis  in  (7.8)  . Then  for  each  f eC (I) , we  have  (f ) is  an  al- 
gebraic  polynomial  of  degree  < n for  which 

(7.12)  !f(x)-p  (f,x)’  < Cm  (f,n  ^l-x2)^),  -1<x<1,  n>2, 

with  C an  absolute  constant . 

Proof.  Let  geW  (I)  with  !lg"ll  = M.  For  such  g,  we  have 

,00  00  — lx 

from  Theorem  7.1  that  I g (^ 1 ) — A (g,+l)|  < CMn  and  hence 
“A  ^ 

|Un(g)||oo  < CMn  . From  Theorem  7.1,  we  have 

(7.13)  | g(x)-y  (g,x)  I < lg(x)-A  (g,x)'  + I'  l (g)'| 

n n n 00 

< CM( (A  (x))2  + n"4)  < CM (A  (x))2,  -1<x<1. 

- n - n - - 

This  inequality  can  be  improved  near  the  end  points  by  using 

the  fact  that  ^(g.ll)  - g(il).  For  example,  when  0<x<  1 , 

(7.14)  |g(x)-pn(g,x) | < | x-1 | lg'(C)-(un(g))’(C)| 

1 I x-1 1 (|g,(0-(An(g))'(0!  + CMn-4) 

< CM | x-1 1 (A  (O  + n~4)  < CM(1-x2)(A  (x)), 

— n — n 

where  in  the  first  inequality  we  used  the  mean  value  theorem 

with  x<£<1,  in  the  second  inequality  we  used  the  fact  that 
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|l  (1  (g))’^  < CMn  ^ , in  the  third  inequality  we  used  Lemma  7.3 

n.  2 
and  in  the  last  inequality  we  used  the  fact  that  1-x  < 1-x  , 

for  0<x<l,  and  A (£)  < A (x) , because  0<x<£.  The  same  inequal- 
- - n - n ~ 

ity  as  (7.14)  holds  when  -l^x<0. 

When  we  superimpose  the  two  inequalities  (7.13)  and  (7.14) 
we  have 

(7.15)  1 g (x)-y^ (g ,x) | < CM(l~x2)n  2,  -1<x<1 , 

with  C an  absolute  constant.  This  is  our  inequality  for  func- 
tions in  W0  (I).  The  theorem  now  follows  from  Theorem  2.4 
2, 00 

since  the  operators  are  uniformly  bounded. 

8 Monotone  approximation 

Recently,  there  has  been  more  attention  given  to  approx- 
imation with  constraints.  The  appearance  of  a constraint  can 
make  it  more  difficult  to  obtain  Jackson  type  estimates.  Con- 
straints with  a fixed,  finite  number  of  functional  equations  or 
inequalities  usually  pose  no  difficulties  once  the  degree  is 
large  enough.  However,  when  the  constraint  involves  an  infin- 
ite number  of  inequalities,  that  are  perhaps  incompatible  with 
the  norm,  then  the  correct  direct  estimates  may  be  quite  for- 
midable to  obtain.  Still,  the  general  lines  of  attack  devel- 
oped for  the  non-constrained  problem  can  be  very  useful.  We 
want  to  indicate  what  modifications  are  necessary  to  push  this 
approach  through.  We  do  this  for  monotone  approximation,  which 
has  been  a prototype  for  constrained  problems. 

In  monotone  approximation,  we  are  given  a monotone  nondec- 
reasing function  f (ft)  on  I and  we  want  to  approximate  f in 
turn  by  polynomials  or  splines  that  are  likewise  nondecreasing. 
The  question  then  is:  does  the  constraint  cost  us  anything  or 
can  we  achieve  the  same  degree  of  approximation  as  in  the  non- 
constrained  case? 

It  is  again  meaningful  to  break  up  the  problem  into  two 
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subproblems:  i)  approximate  f by  a smooth  function  g,  ii)  app- 
roximate g by  monotone  polynomials  or  splines  as  the  case  may 
be.  Now,  it  is  clear  that  in  i),  we  should  not  approximate  by 
arbitrary  g's  but  we  should  try  to  retain  the  monotonicity. 
This  leads  then  to  the  following  definition  of  a constrained 
K-functional . 


(8.1)  Kr(f,t) 


inf  { ||f-g!l  + t " g fr)|1  J 


geW  (I),  g+ 


r 


In  Section  2,  we  were  able  to  show  that  K (f,t  ) and 

r 

uj r ( f , t ) are  asymptotically  equivalent  for  each  feC(I).  But 
the  constructions  given  there  do  not  preserve  monotonicity. 
Therefore,  we  immediately  have  the  question  of  how  K+  compares 


to  0) 


For  r-1,  it  is  easy  to  show  that  K^Cf.t)  and  w^(f,t) 


are  equivalent  when  ft  by  using  spline  approximation. 


Let  n “ {xfn^}^n  with  xfn^  - -1+in  * i-0, 1 , . . . ,2n . 

n 1 i ‘ o i 

When  r-1,  the  piecewise  linear  spline  S e^„(lT  ) which  interpol- 
r n v2  n 

ates  f at  the  knots  will  be  monotone  nondecreasing  and  satisfy 

(8.2)  ||f-Sn(|oj<  (^(f.n"1),  n-1,2,... 

-1, 


-1 


(8.3)  I'S^IL  1 ni^ff.n'1),  n-1,2, 


If  t>0,  then  take  n so  that  (n+1)  ^ < t < n V 


Then , 


(8.4)  K (f,t) 
r 


f-S 


+ t 


's'. 

n - 


-1. 


< 2uj  ^ (f  ,n  ) < 4u^  (f , t)  . 


We  already  know  the  reverse  inequality  because  of  Theorem  2.1. 

For  r=2,  we  can  use  the  variation  diminishing  splines  of 

I.  Schoenberg.  Let  H be  as  above  and  let  V (f)  be  the  var- 

n n,m 

iation  diminishing  spline  of  order  m with  knots  (see  .26 

for  the  definition  and  properties  of  variation  diminishing 

splines).  If  f+,thenV  (f)  is  also  nondecreasing.  Also,  it 

n ,m 

is  know  (see  e.g.  [ll,Ch.2]),  that  when  geW  (I),  then 

9 °° 

(8.5)  l!8'Vn,m(8)|!»  - Cn'2  M-* 

with  C a constant  depending  only  on  m.  Hence,  from  Theorem  2.3 
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(8.6)  I!  f-Vn  jn\\a  i Cw2 (f  ,n_1)  , n-1,2 


we  have  that  for  an  arbitrary  feC(I), 

-1 

n,m 

with  C again  depending  only  on  m. 

For  our  purposes,  we  need  only  take  m=3  and  let  S -V  „(f). 

n n,3 

Then,  it  is  easy  to  estimate  Sjj.  On  each  interval 
(xfn),xfnh,  0<i<2n,  S"  = a.  is  a constant.  When  0<2t<n  ^ , 


,X  . , ; 

1 * 1+1; 


lot. ! - It  2A^(Sn,x^n;)|  < t'^{|A^(f-Sn,x^n))l  + 

|A2(f,x{n))|}  < Ct"2  {o)2(f,n_1)  + o)2(f,t)}, 

where  we  used  (8.6).  Now,  take  t-l/3n  to  find 


(n). 


-2, 


(n), 


(8.7)  |1S^| 


2 1 

max  jaj  < Cn  u (f,n  ),  n=l,2 

i 


with  C an  absolute  constant.  When  (8.6)  and  (8.7)  are  used  as 
in  the  proof  of  (8.4),  we  get 

(8.8)  K2(f,t2)  < Cw2(f,t),  t>0, 

with  C an  absolute  constant.  The  converse  of  (8.8)  is  given  in 
Theorem  2.1. 

It  is  still  not  known  whether  (8.4)  and  (8.8)  extend  to 
arbitrary  r.  However,  a weaker  estimate  for  general  r can  be 
obtained  using  the  following  theorem  on  spline  approximation. 

THEOREM  8.1.  Let  II  = {xfn)}2n,  x^n)  = -1+in"1,  0<i<2n.  If 
n 1 l J o l - - — 

feC(I),  f+,  and  r=l  or  2,  then  there  are  splines  S ej  (H  ), 

n r n 

with  S +,  and 
n 

(8.9)  Hf-Sjl^  < Cur (f ,n_1) , n-1,2 

(k) 

with  C an  absolute  constant . For  r>2,  and  f eC(I) , k<r,  ft 

there  are  splines  S tS  (n  ),  with  S +,  and 
n r n n 

(8.10)  Hf-Sjl^  < Cn^ai^fJ^.n'1),  n-1,2 

with  C £ constant  depending  only  on  r. 

Proof.  The  proof  of  the  existence  of  splines  which  satisfy 
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(8.9)  was  already  given  above.  The  proof  of  (8.10)  is  much 
more  difficult  and  is  given  in  [l3~l.  The  construction  of  these 
splines  is  non-linear  and  highly  local.  We  dont  want  to  give 
the  details  but  nevertheless,  we  can  indicate  briefly  why  such 
a construction  is  plausible. 

Let  en  « n"ko)1(f ,n-1) . If  f(k)eC(I),  with  k>2  (we  al- 
ready know  the  cases  k=0,l  from  (8.6)),  then  the  spline  S = 

n 

^ given  in  (4.10)  will  satisfy 

(8.11)  |!f(l)-S(l)||  < Cn1^  , i-0,1. 

n 00  - n 

This  estimate  for  i=0  is  already  proved  in  Theorem  4.2  and  the 

estimate  for  i«l  is  proved  similarily  (see  [4]).  Hence,  if  ft 

and  f'(x)>Cne  , xel,  then  by  virtue  of  (8.11)  with  i=l,  the 
- n 

spline  S will  have  S'(x)>0,  xel,  and  so  S t,  as  desired, 
n n - n 

On  the  other  hand,  when  f'(x)<Cne  ,x  I,  then  as  shown  in 

- n 

(8.6),  the  spline  S ■ V (f)  satisfies 

n n , r 

Ilf-sJL  i c"-1  ML  ; cen. 

We  know  that  S^t  and  so  again  we  have  the  desired  result. 

Therefore,  we  know  the  validity  of  (8.10)  when  f'  is  al- 
ways bigger  than  Cne  on  I and  also  when  f'  is  always  smaller 

n 

than  Cne  on  I.  What  is  needed  then  is  to  blend  these  two  re- 
n 

suits.  This  is  done  by  breaking  up  the  interval  I into  inter- 
vals where  f'  is  "large"  and  "small"  respectively  and  then  con- 
struct appropriate  spline  approximation  for  the  separate  cases. 

t . 

Theorem  8.1  allows  us  to  give  an  estimate  for  in  terms 
of  smoothness. 

THEOREM  8.2.  _If  feC(I)  and  ft,  then  for  r-1,2, 

(8.12)  K*(f,tr)  < Ca>r(f,t),  t>0, 

(k) 

with  C jin  absolute  constant . For  r>2,  0<k<r,  f eC(I) , and  ft 

(8.13)  K^(f,tr)  < CtkM1(f,t),  t>0, 
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with  C depending  only  on  r. 

Proof . We  have  already  shown  (8.12).  To  prove  (8.13),  we  use 

the  splines  of  Theorem  8.1.  For  r>2,  according  to  Theorem  8.1, 

there  is  a spline  S eS“  , _ (TE  ) such  that  S + and 
r n r+1  n n 


(8.14)  II  f-S 


< Cn“ko)1(f(k)  ,n_1). 


Similar  to  the  way  we  estimated  the  derivative  of  V -(f)  in 

n j 3 

(8.7),  it  can  be  shown  (see  [l4]  for  details)  that 


(8.15) 


- r r-k  /r(k)  "U 
< Cn  oj^  (f  ,n  ) , 


with  C depending  only  on  r.  For  0<t,  we  choose  n so  that 
(n+1)  ^"<t<n  ^ and  then  use  in  the  same  way  we  estimated  in 
(8.4)  to  obtain  (8.13). 

Monotone  approximation  by  algebraic  polynomials  parallels 
to  a large  extent  that  of  spline  approximation  in  that  it  is 
relatively  easy  to  obtain  the  Jackson  estimates  in  terms  of 
and  ^2 > but  beyond  that  things  become  much  more  difficult.  Of 
course,  now  we  have  the  advantage  of  using  Theorem  8.2,  which 
was  proved  using  spline  approximation. 

The  lower  order  estimates  for  monotone  polynomial  approx- 
imation follow  from  the  constructions  of  G.  G.  Lorentz  - K. 
Zeller  [24],  r=l,  and  G.  G.  Lorentz  C23]  , r*2.  The  Lorentz- 
Zeller  proof  is  to  go  to  the  trigonometric  case  via  the  sub- 
stitution x=  cos  9.  The  function  h(9)  = f(cos  9)  is  now  a 
bell  shaped  function  which  in  turn  must  be  approximated  by  bell 
shaped  trigonometric  polynomials  of  degree  n.  This  is  done  by 
approximating  h first  by  an  interpolating  spline  of  order  r 
with  2n  equally  spaced  knots  and  then  using  the  Jackson  oper- 
ators of  Section  5 to  approximate  the  spline  by  a trigonometric 
polynomial . 

It  is  also  possible  to  argue  directly  in  the  same  spirit 
as  Section  6.  Consider  the  case  r=2  which  includes  the  case 
r*l,  since  (^(f.t)  < 2w^(f,t),  t>0.  Suppose  that  geW2  ^(1), 
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' ■ g"  1 1 oo  = M and  g+  • First  extend  g outside  of  I by  setting  g(x) 

= g(l)+g'(  1) (x-1)  , x>l  and  g(x)  » g (-1) +g ' (-1) (x+1) , x<-l.  The 
extended  g is  still  nondecreasing  and  has  I!  g"||oo[-2,2  ] = M.  The 
fact  that  we  can  extend  g monotonically  without  losing  smooth- 
ness is  one  of  the  basic  steps  that  can  not  be  done  in  general. 

We  can  also  make  a linear  adjustment  without  changing  the 
monotonicity  of  g and  thereby  assume  that  g(-2)=0  and  g’(a)=0, 

for  some  -2<a<2.  Analogous  to  Section  6,  we  set 

2 

(8.16)  Ln(g,x)  = / g(t)un(x-t)dt, 

with  for  each  n,  a non-negative,  even  algebraic  polynomial 
of  degree  n with  integral  1 over  f-2,2^  and 
3 2 -2 

(8.  / t u (t)dt  < Cn  , n=l ,2 , . . . , 

-3  n 

with  C an  absolute  constant.  We  also  assume  that 

CS.  17)  IIpJLF-3,-1]  1 c^'2. 

The  polynomials  of  Section  6 satisfy  all  the  above. 

As  we  argued  in  Section  6, (see  [llj),  one  can  show  that 

l1g-Ln(g)IL  1 CM""2. 

with  C an  absolute  constant.  The  polynomials  L^(g)  are  not 

necessarily  nondecreasing,  but  we  have  for  xel, 

2 

(Ln^g))'(X)  = ^ g' (t)un(x-t)  + g(-2)pn(x+2)  - g(2)un(x-2) 

> -g(2)C1n'2 

because  g'  and  u are  non-negative,  g(-2)=0  and  (8.17). 

n . -2 

Thus,  the  polynomials  P (x)  = L^(g,x)  + C^g(2)n  x will 
be  nondecreasing  and  satisfy 

(8.18)  Ilg-PjLl  !|g-Ln(g)IL+  V2  llgl!^  CMn'2, 

with  C an  absolute  constant.  Fere,  we  used  the  fact  that 
l|g||oo<  16  (I  g”l! which  follows  because  both  g and  g'  have  a 
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zero  in  [-2,  2].  This  is  our  result  for  smooth  functions  and 
leads  to  the  following  theorem. 


THEOREM  8.3.  _If  feC(I),  ft,  and  r-1  or  2,  then  for  each  n>l, 
there  is  a_  polynomial  of_  degree  < n,  such  that  P t and 

(8.19)  (|  f-P  ||  < Cw  (f  ,n_1)  , 

n — £ 

(k) 

with  C an  absolute  constant . For  r>2,  f eC (I)  , 0<k<r , ft , 

we  have  that  for  each  n>l , there  is  a_  polynomial  P^  o£  degree 

< n,  such  that  P t and 
n 

(8.20)  Ilf-Pjl^  Cn"km1(f(k),n"1), 
with  C a constant  depending  only  on  r. 

Proof . For  r=2  and  feC(I),  take  gt  so  that  H f-gjl^+n  !lg"lloo< 
^).  The  existence  of  g is  guaranteed  by  Theorem  8.2. 
Let  P^t  be  the  polynomial  which  approximates  g according  to 
(8.18).  Then, 


<_  Cn  ‘ Mg"! 

< Cw^f.n'1), 


with  C an  absolute  constant.  This  is  (8.19)  for  r=2  which  also 
also  includes  the  case  r=l. 

For  r>2,  the  story  is  much  the  same  as  for  splines.  The 
proof  is  detailed  and  is  given  in  [l 4 ] . While  the  matter  is  a 
little  simplified  because  of  Theorem  8.2,  we  no  longer  have 
the  nice  B-splines  with  which  to  make  local  corrections. 

This  requires  the  construction  of  polynomials  which  mimic  the 
B-splines  in  that  they  are  large  on  a given  interval  and  fall 
off  fast  outside  of  this  interval. 
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LAGRANGE  INTERPOLATION  AT  ZEROS 
OF  ORTHOGONAL  POLYNOMIALS 
G.  P.  Nevai 

The  purpose  of  this  paper  is  to  give  a detailed  survey  of 
new  results  in  the  theory  of  convergence  of  Lagrange  interpola- 
tion taken  at  zeros  of  orthogonal  polynomials  with  special  em- 
phasis on  results  related  to  infinite  intervals.  Many  open 
problems  are  indicated. 

When  Professor  G.  G.  Lorentz  asked  me  to  write  a short 
survey  I decided  to  choose  the  title  "Interpolation  at  zeros 
of  orthogonal  polynomials  and  related  problems."  What  should 
a survey  with  such  a title  have  contained?  It  should  certainly 
have  dealt  with  problems  of  convergence  and  divergence  of  La- 
grange, Hermite-Fe  jer , lacunary  and  trigonometric  interpolation, 
as  well  as  with  those  of  mechanical  quadrature  processes.  It 
should  have  discussed  some  aspects  of  the  general  theory  of 
orthogonal  polynomials,  in  particular  asymptotic  relations  and 
bounds  for  orthogonal  polynomials  and  the  distribution  of  zeros 
of  orthogonal  polynomials.  Questions  of  weighted  uniform  and 
L*3  approximation  of  functions  by  polynomials  should  have  been 
mentioned  also.  Furthermore,  I should  have  considered  problems 
of  stability  of  interpolation  and  that  of  "rough  and  fine"  in- 
terpolation theory.  The  use  of  interpolation  in  solving  other 
problems  should  also  have  been  discussed.  In  short,  I soon 
realized  that  such  an  article  would  be  impossibly  long.  Thus 
I decided  to  restrict  my  attention  to  convergence  and  diver- 
gence of  Lagrange  interpolation,  this  being  the  topic  with 
which  I am  most  familiar. 

This  survey  will  be  in  no  sense  complete.  For  instance, 
neither  Fejer's  nor  Szego's  very  important  results  will  be 

Preceding  page  blank 
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mentioned.  I will  concentrate  only  on  recent  results,  but  even 
so,  the  paper  will  be  far  from  exhaustive.  The  material  con- 
sidered is  divided  into  two  parts;  the  first  deals  with  point- 
wise  convergence  and  the  second  with  mean  convergence. 

Let  a be  a bounded,  non-decreasing  function  defined  on  ]R  . 
We  say  that  a is  a weight  function  if  it  has  infinitel;-'  many 
points  of  increase  and  all  the  moments  of  cx  are  finite.  If 
0!  is  absolutely  continuous  we  write  w = a1  . For  a given 
weight  CX  there  exists  a unique  sequence  of  polynomials 

p (da,x)  = y (da)xn  + ...  (n  = 0,1,...)  having  the  properties 
n n 

Y (da)  >0  and 
n 

00 

/ p (da,x)p  (da,x)da(x)  = 5 
J n m am 

-oo 

The  zeros  of  p^(da)  are  real  and  distinct;  we  denote  them  by 

x,  (da):  X1r,(da)  > x„  (da)  > ...  > X (da).  For  a given  func- 
kn  in  2n  nn 

tion  f the  Lagrange  interpolation  polynomial  L (da,f)  cor- 

n 

responding  to  the  weight  Cx  is  defined  to  be  the  unique  alge- 
braic polynomial  of  degree  at  most  n-1  which  coincides  with 
f at  the  nodes  x^(da)  (k  = 1,2,..., n).  Thus 

n 

L (da,f,x)  = / f(x  (da))i,  (da,x) 

n . . kn  kn 

k=  1 

where  i^(da)  are  the  fundamental  polynomials.  An  important 

quantity  we  shall  need  later  is  the  Christoffel  function 

\ (da)  corresponding  to  the  weight  a.  It  may  be  defined  in 
n 

several  ways;  for  instance, 

m 2 

k (da,x)  = min  / 7 r (t)da(t) 

7T£  IP  . -00 

2 n_1 

rr  (x)=l 

where  IP^  denotes  the  set  of  polynomials  of  degree  at  most  n. 
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By  v/a,b-*we  shall  denote  the  Jacobi  weight,  that  is, 
w(a,b)(x)  = (l-x)a(l+x)b  for  | x | < 1 and  w^a,b^(x)  = 0; 
otherwise,  (a,b  > -1). 

J . Pointwise  convergence 

1.  We  begin  with  the  classical  and  most  thoroughly  in- 
vestigated part  of  the  theory  of  Lagrange  interpolation,  name- 
ly, pointwise  convergence  of  L^Cda.f)  when  the  support  of 
da  is  compact.  Recent  investigations  here  may  be  classified 
as  follows! 

(i)  Consideration  of  L^da.f)  when  a is  not  neces- 
sarily absolutely  continuous. 

(ii)  Finding  estimates  which  hold  uniformly  on  the  whole 

support  of  da  for  the  deviation  of  L (da,f)  from  f and 

n 

similarly  for  the  corresponding  Lebesgue  function. 

(iii)  Improvement  of  convergence  either  by  adding  a few 
additional  points  to  the  set  of  nodes  or  by  using  certain 
other  tricks. 

(iv)  Determination  of  conditions  assuring  divergence. 
Because  of  the  huge  number  of  results,  we  will  illustrate 

the  above  classification  with  only  a few  selected  theorems. 
Concerning  (i),  G.  Freud  [28]  and  Ja.  L.  Geronimus  [49]  have 
obtained  many  interesting  theorems,  among  which  the  following 
seems  to  be  the  strongest. 

THEOREM  1.  Let  supp(da)  be  compact  and  let  |p  (dO£,x)|  be 
uniformly  bounded  for  x e m ^ supp(da).  Let  f be  a func- 
tion vanishing  in  a neighbourhood  of  the  interval  A with 

00  2 

(1)  / |f(t)|da(t)<'», 

-OO 

where  the  integral  is  understood  in  Riemann-Stieltjes  sense. 
Then  Ln(da,f,x)  converges  to  0 uniformly  in  x e m 0 a 
when  n — > oo. 
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Theorem  1 combined  with  the  classical  machinery  can  be  used 
to  prove  a lot  of  convergence  theorems  at  individual  points. 

One  of  them  is  the  following 


THEOREM  2.  Let  x^  be  fixed  and  let  f satis fy  (1).  Let  a 
be  absolutely  continuous  in  a neighbourhood  v(x^)  £f  x^.  I_f 
a'  (x)  < M < <»  and  | pn(da,x) | <M<»  (n  = 1,2  , . . . ) for 
x e V(xQ)  and  f is  of  bounded  variation  in  V(Xp)  then 

lim  L (dOt,f,x  ) = f(x  ) 
n u u 

n— >w 

provided  that  f is  continuous  at  x^. 

For  further  results  we  refer  the  reader  to  [25],  [49], 
[59],  [60],  [79]  and  [81], 


The  important  role  of  Lebesgue  functions  is  well  known. 
Nevertheless,  for  general  weights  only  fairly  weak  estimates 
have  been  obtained.  We  know  that  if  supp(da)  c [-1,1]  and  a' 
is  bigger  than  the  Chebyshev  weight,  then  the  corresponding 
Lebesgue  function  is  of  order  /n*  uniformly  in  [-1,1].  If  a' 
is  strictly  positive  on  [-1,1],  then  the  Lebesgue  function  is 
of  order  on  every  interval  Ac  (-1,1),  and  it  is  of  order 

n on  [-1,1].  These  are  the  strongest  estimates  known.  If  we 
impose  conditions  also  on  |p^(da)]  we  can  obtain  better  esti- 
mates as  it  was  done  by  G.  Freud  [25]  (see  also  [49]).  It 
should  be  possible  to  determine  the  exact  order  of  Lebesgue 
functions  of  Lagrange  interpolation  corresponding  to  weights  of 
the  form  w = (Jw‘  where  0 satisfies  some  growth  and 

smoothness  conditions.  For  $ = 1,  G.  I.  Natanson  [70]  proved 
the  next 


THEOREM  3. 


Let  a,b  > - — and  let  1 (a,b,x)  denote  the 
z n 


Lebesgue  function  of  Lagrange  interpolation  taken  at  the  zeros  of 
pn(w^a,k^).  Then  |f  (a,b,x)  - l|  is  exactly  of  order 

I nI,i  . „(a/2 + l/4,b/2  + 1/4),  . 

! Pn(w  ,x)|(l  + w'  '(xjlog  n) 
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when  n — > oo. 

We  hope  to  soon  obtain  similar  estimates  when  $ > 0 and 
belongs  to  lip(3.  See  also  [54],  [56],  and  [94]. 

As  mentioned  above,  if  supp(da)  a [-1,1]  and  a'  (x)  > 
m > 0 for  almost  every  x e [-1,1],  then  it  follows  that  for 
every  f e lip  Ln(da,f,x)  converges  to  f(x)  uniformly  on 
[a,b]  c (-1,1).  G.  Freud  [30]  has  asked  why  this  convergence 
is  not  uniform  on  the  whole  interval  [-1,1].  One  possible  ex- 
planation is  that  supp(da)  c [-1,1]  implies  that  |x  (da)  | < 1 

Kn 

(and  therefore  Ln(da,f,±l)  4 f(±l))  and  |p^(da,±l)|  grows 
so  rapidly  that  Ln(da,f,±l)  cannot  converge  to  f(±l).  For 
this  reason  G.  Freud  suggested  adding  two  new  points  to  the  set 
of  nodes,  namely  ±1.  Let  us  denote  by  Ln(da,f)  the  Lagrange 
interpolation  polynomials  taken  at  the  nodes  {x^^da) ) U [±1 ) . 

/ 2* 

THEOREM  4.  Let  supp(da)c  [-1,1]  and  a' (x)>^/l-x  for 
almost  all  x e (-1,1).  Suppose  also  that 

/ satLL.  < 

I f f e lip  j £n  [-1,1]  then  L*(da,f)  converges  to  f uni- 
formly on  [-1,1]  when  n — > oo. 

For  further  development  of  this  idea  see  [30],  [31]. 
Concerning  estimations  from  below  and  divergence  of  La- 
grange interpolation  we  mention  two  results  which  seem  to  be 
typical . 


THEOREM  5 . Let  v denote  the  Chebyshev  weight.  If  o>  is  a 
modulus  of  continuity  with 


lim 
t— »+0 


^ ( t ) 

t | log  t 


0 


then  there  exists  a continuous  function  f on  [-1,1]  with 
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oj(f,5)  < o>(6)  and 

| L (v,f ,l)-f (1) | 

lim  sup  — 11 77-7 — r > 0. 

ou(l/n) 

n — > 00 

THEOREM  6.  Let  supp  (da)  a [-1,1]  and  let  £ (da)  denote 
the  corresponding  Lebesgue  constants.  Let  a be  a modulus  of 
continuity.  If 

00  > A = lim  sup  oj  { — — j £ (da)  > 0 

n — > 00  ' n £ (da)  • n 

n 

then  there  exists  a continuous  f with  o>(f,5)  < Cuj(5)  and 

lim  sup  max  |f(x)  - L (da,f,x)|  = A. 

n ->oo  |x  | < 1 ° 

For  other  results  we  refer  the  reader  to  works  of  0.  Kis, 
J.  Szabados,  P.  Vertesi,  D.  L.  Berman.  The  above  theorems  were 
proved  in  [54]  and  [58]  , respectively. 

Although  much  more  could  be  said  about  the  case  when 
supp(da)  is  compact,  we  stop  here,  for  there  are  other  things 
we  would  like  to  mention.  Less  is  known  about  the  behavior  of 
Lagrange  interpolation  processes  corresponding  to  weights  with 
non-compact  support.  What  is  known  will  be  discussed  in  the 
next  three  sections. 

2.  Here  we  will  give  two  theorems.  They  are  the  only 
known  convergence  theorems  when  supp (da)  is  non-compact  and 
a is  not  one  of  the  classical  weights.  V'e  would  like  to  hope 
that  the  reader,  realizing  how  weak  the  statements  below  are, 
will  try  to  find  better  results.  The  significance  of  the  next 
theorem  of  G.  Freud  [27]  is  that  it  was  the  first  successful 
attempt  to  show  convergence  of  Lagrange  interpolation  on  an 
infinite  interval. 

THEOREM  7.  Let  f e lip  j-  on  ]R  . _l_f  w satisfies  the 
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conditions 


w(x)  > 


c^exp(-ax' 


), 


f expCc^x  )w(x)dx  < oo 

“CO 


(c^  ,c2  ,a  > 0)  then 

lim  sup  exp  (- x^  ) I f (x)  - L (w,f,x)|  = 0. 
m l ' n 

n-H>oo  xeK 

This  theorem  can  be  easily  generalized  to  weights  w with 

a 00  o 

w(x)  > c ^ exp(-a|x|  ),  / exp(c2  |x  |p)w(x)dx  < oo 

-CO 

for  |3  = 4,  6,  8 ...  but  it  would  be  really  interesting  to 

know  what  happens  if  (3  e 1R+  . If  instead  of  uniform  conver- 

gence we  are  interested  only  in  convergence  almost  everywhere, 
then  the  class  of  weights  in  consideration  can  be  extended. 

The  following  theorem  has  been  proved  in  [72]. 

THEOREM  8.  Let  supp(da)  be  non-compact.  Suppose  that  there 
exists  a function  g which  satisfies  the  following  two  condi- 
tions . 

-1  2 

( i ) g e and  the  Gauss-Jacobi  mechanical  quadra- 
ture process  taken  at  the  zeros  of  p (da)  is  convergent  for 

|g|'2. 

( i i ) For  every  non-negative  integer  n 


|xng(x)  | < C-  (g  )n  (x  e 1R) 
n 


where  g > 1,  g n 10  when  n — » oo.  Suppose  also  that  f 

n - n — 1-1 

bounded  on  ]R  and  for  some  natural  integer  R the  modulus 
smoothness  of  R-th  order  of  f satisfies  the  condition 


z 


n=  1 


^D(f,g  n ) 


< CO. 


is 

of 
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Then  for  da  almost  every  x,  Ln(da,f,x)  converses  to  f(x) 
when  n -4  oo. 

This  theorem  can  be  applied  to  the  case  where  a'(x)  = 

|x  |Fexp(- |x  |^)  (e  > -1,  3 = 2,4,...).  In  this  case  we  can  put 
g(x)  = exp(-  ^|x|^)  and  then  = (4n)^^. 

We  conclude  this  section  with  a problem  which  is  closely 
related  to  Freud's  theorem.  Let  a'(x)  = exp(-|x|  ) (p  e ]R  , 
x e IR)  . What  can  be  said  about  the  magnitude  of  the  sum 

^.Akn(da)eXp(c|xkn|P) 

k=l 

(A^Cda)  = An(da,xkn(da)))?  We  suspect  that  it  is  bounded  for 

1 /ft 

c < 1 and  it  is  of  order  n for  c = 1.  The  case  c > 1 is 

also  interesting  (see  [73]). 

3.  Here  we  will  be  concerned  with  pointwise  convergence 

of  Lagrange  interpolation  taken  at  the  Hermite  abscissas,  that 

2 

is,  w(x)  = exp(-x  ),  x e IR  . Even  if  we  restrict  ourselves  to 
consider  the  simplest  estimates,  that  is,  estimates  like 

|f(x)  - Ln(w,f  ,x)  | < something-<jj(f , ?), 

there  are  two  problems  which  arise  immediately.  First,  there 
is  no  hope  of  finding  estimates  where  the  right  side  is  inde- 
pendent of  x e IR.  Secondly,  the  classical  methods  which  use 
best  approximating  polynomials  and  Lebesgue  function  estimates 
might  fail  since  we  have  functions  defined  on  the  whole  real 

line.  The  first  problem  can  be  eliminated  easily.  Since  in 

2 12 
our  case  w(x)  = exp(-x  ) one  may  expect  a factor  of  exp^x  ) 

to  appear  on  the  right  side.  The  second  problem  may  be  removed 

in  the  following  way.  We  know  that  Ln(w,f)  depends  only  on 

those  values  of  f which  it  takes  at  the  Hermite  abscissas  and 

the  biggest  zero  of  the  n-th  Hermite  polynomial  is  less  than 

J2  n+1' . So  let  us  apply  the  Jackson-Brudny  theorem  on  the 
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interval  (-  /2n+I\  /2n+l' ) and  see  what  happens.  The  best  re- 

sult one  can  hope  to  obtain  would  be 

12  / i 

I f (x ) - L (w,f,x)|  < C expOj-x  ) log  n uoRlf,  “ 

K V /n 

(here  oj  ( f , 5 ) denotes  the  R-th  modulus  of  smoothness  of  f). 

R 

This  was  the  estimate  which  G.  Freud  sought  to  prove  in  1968. 

To  his  (and  others')  surprise  he  managed  to  obtain  a much 
stronger  result: 

THEOREM  9.  Let  f be  uniformly  continuous  on  !R.  Then 
|f (x)-Ln(w,f,x)  | < cJ~log  n + exp(|-x2)J  o)R^f,  ^ j 


for  every  real  x and  natural  integer  R with  the  constant 
depending  only  on  R. 


Theorem  9 cannot  be  improved,  as  is  shown  by  the  next  two 
theorems  (see  [29],  [57]). 


THEOREM  10.  For  every  modulus  of  continuity  oj(6)  there  exist 
a uniformly  continuous  function  F,  an  increasing  sequence  of 
natural  integers  { n^_  j and  a sequence  (x^)  such  that 


,(F,5)  < cu(6) 


and 


lF(Xr>"Ln  (w»F’xr>  I > C [lo8  nr  + exP  x^ )J  ^ ( ~~~) 

r ' vn  ! 


for  r = 1,2,...  . 


Let  0 be  continuous,  nondecreasing  on  ]R  with  0(0) = 0, 

-R  -f- 

0(5)  >0  (5  > 0)  and  let  0(5)6  be  non-increasing  on  1R 

for  a fixed  natural  integer  R with 


lim  0(6)6'R  = +oo  . 
g^+0 
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THEOREM  11.  There  exist  a uniformly  continuous  function  G and 

spquences  I n } and  fx  ) such  that 

— j r r 

u.r(G,  5)  < <Z>(&) 

and 

|G(xr)  - Ln  (w,G,xr)|  > cjjLog  nr  + exp(jx2)j  d | 

r r 

for  r = 1,2  , . . . . 

When  G.  Freud  drew  my  attention  to  Lagrange  interpolation 
on  infinite  interval,  the  first  thing  I wanted  to  prove  was 
the  convergence  of  Ln(w,f)  when  f is  of  bounded  variation. 

I had  been  trying  to  apply  all  the  known  methods  to  prove  this 
but  none  of  them  worked.  Then  1 remembered  that  in  the  theory 
of  trigonometric  Fourier  series  the  simple  formula 

Sn(x,f)  - f(x)  = j[Sn(x,f)  - Sn(x  +^,  f)]  + 0[cu(f,  i)] 

had  helped  me  to  find  many  interesting  results  and  1 tried  to 
find  a similar  representation  for  the  deviation  of  Ln(w,f) 
from  f.  This  was  done  in  [79]'. 

THEOREM  12 . Let  f be  uniformly  continuous  on  H and  let 
e > 0 be  fixed.  Then 


sup  exp  ( - |-x2)  | L^  (w , f , x ) - f(x)  - \ Z _ [f(xkn>- 


xelR 


X"Xknl<€ 


f(xk-l,n)]WW’x)l  = 

for  n = 1,2,... 

Let  us  remark  that  I suspect  this  is  not  the  best  possible 
estimate;  the  right  side  should  perhaps  be 


•K‘-  y 


With 


the  aid  of  the  above  formula  I could  easily  prove  convergence 
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for  functions  having  bounded  variation.  But  this  formula  did 
more  than  simply  provide  a lemma  for  proving  convergence  for 
functions  expressible  as  the  difference  of  two  monotonic  func- 
tions. It  turned  out  that  it  implicitly  contained  many  con- 
vergence tests. 

Let  us  say  that  f satisfies  the  one-sided  Dini-Lipschitz 
condition  on  the  (finite  or  infinite)  interval  (a,b)  if 

f (x+5)  - f (x)  > - •|1Qg'^  |-  ' (a  < x < x + 6<  b) 

where  5 is  small,  v(S)  >0  is  nondecreasing  with  lim  v(5)=0. 

&— > +0 

It  is  obvious  that  every  function  of  bounded  variation  is  the 
difference  of  two  functions  satisfying  the  one-sided  Dini- 
Lipschitz  condition  (with  v = 0,  say). 

THEOREM  13.  Let  f be  uniformly  continuous  on  ]R  and  satisfy 
the  one-sided  Dini-Lipschitz  condition  on  the  (finite  or  in- 
finite) interval  (a,b).  Lf  0 < e < (b-a)/2  then 

1 2 

lim  sup  exp(-  j x ) | f (x)  - L (w,  f ,x)  | = 0. 

n-»o>  a+e  < x < b - e 

This  theorem  does  not  give  the  rate  of  convergence,  but 
the  next  convergence  test  will  have  this  feature.  Let  0 be 
defined  on  ]R^  and  let 

0(0)  = 0,  0(x)  < 0(y ) (x  < y). 

Following  L.  C.  Young  [98]  we  say  that  f has  bounded 
0 variation  on  the  (finite  or  infinite)  interval  (a,b)  if 

Vf)  = sup  E 0 ( | f (b  . ) - f(a.)|)  < 00 
w t i 1 1 

where  t is  an  arbitrary  finite  system  of  finite  disjoint  in- 
tervals (a^.b^Jc:  (a,b). 

THEOREM  14.  Let  f be  uniformly  continuous  on  ]R  and  be  of 
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bounded  variation  on  (a,b)  c ]R  . _If  log  0 e L (0,1)  and 

. b-a  , 

0 < e < —y—  then 

1 2 

sup  exp  (-  yx  ) I f (x ) - Ln(w,f,x)|  = 

a+e  < x < b - e 


= 0 


u)(f,l//tT) 

/ 

0 


I log 


0(t) 


If  0(0  = iP  (1  < p < ro),  that  is,  in  case  of  functions 
of  bounded  p variation  (see  [96]),  the  integral  on  the  right 
side  is  of  order 


108  “(f>  ^M£l  7s)- 


The  reader  might  notice  that  all  the  above  theorems  require 

uniform  continuity  on  the  whole  real  axis  of  functions  under 

consideration,  even  if  we  are  interested  in  convergence  only  at 

individual  points.  This  strong  requirement  can  be  dropped  by 

using  the  following  localization  theorem  proved  in  [80].  I 

would  like  to  emphasize  that  it  is  by  no  means  clear  that  this 

is  definitely  the  best  possible  version  of  the  theorem. 

Let  R denote  the  set  of  those  functions  f defined  on 

the  real  line  which  are  Riemann  integrable  on  every  finite  in- 

2 

terval  and  are  of  order  exp(ex  ) when  x ->  » where  e = e(f) 
is  non-negative  and  less  than  1/2.  Since  Riemann  integrable 
functions  are  always  bounded  we  may  suppose  that  for  every  f 
in  R 

| f (x)  | < A exp  (ex2)  (x  e ]R  , A = A(f),  e = e(f)  < y ). 

THEOREM  15.  If  f e R and  f = 0 on  [a-5,  b+6]  (6  > 0)  then 
1 2 

max  exp(-  yx  ) |l  (w,x,f)|  = o(l) 
a < x < b 
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when  n ->  oo  and  o(l)  depends  only  on  f and  8 but  not  on 
the  interval  (a,b). 


The  proof  of  Theorem  15  is  rather  complicated  and  is  based 
on  two  facts.  One  of  them  is  the  definition  of  the  Riemann 
integral.  The  other  is  the  inequality 


z 


|x--knl>8 


kn 


iexp<£xL)ik„("’x)  + ^ 


< c 


A 2^ 

expCj  x ) 


for  8 > 0,  0 < e < |x  | < /n  . 

Theorem  15  can  be  used  to  prove  localized  convergence 
theorems.  We  shall  mention  three  conditions  assuring  conver- 
gence of  L (w,f)  to  f and  a given  point  x.. 

n U 


THEOREM  16.  Let  f e R and  x^  be  fixed.  Let  f satisfy 
one  of  the  following  three  conditions! 

(i)  | f (xQ  + t)  - f(xQ)|  < p(t)  (0  < t < 8) 

where  p.  is  a nondecreasing  function  with 

6 

lim  J dt  < oo. 

p-»+0  p 

(ii)  f is  continuous  at  x^  and  satisfies  the  one-sided  Dini- 
Lipschitz  condition  in  a neighborhood  of  x^. 

(iii)  f is  continuous  at  x and  is  of  bounded  <b  variation 

^ 1 

in  a neighborhood  of  x^  with  0 e L (0,1).  Then 

lim  Ln(w,f,xQ)  = f(xQ). 

n— »oo 


4.  The  results  of  the  previous  section  could  surely  be 
obtained  also  for  Lagrange  interpolation  based  on  the  zeros  of 


4 
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generalized  Hermite  polynomials  (w(x)  = |x  Pexp  (-x^),  a>  -1, 

oc 

x e 1R  ) and  Laguerre  polynomials  (w(x)  = x exp(-x),  a > -1 , 
x e 1R+)  , respectively.  The  only  new  difficulties  arise  from 
the  fact  that  these  two  weights  have  singularities  at  zero. 

Some  of  the  corresponding  theorems  have  been  found  already,  but 
other  problems  (like  localization)  are  still  open.  The  next 
theorem  was  proved  by  0.  Kis  [57].  (For  other  related  results 
see  [71],  [77],  [79].)  Denoting  the  generalized  Hermite  weight 
by  w^  we  have: 

THEOREM  17.  Let  f be  uniformly  continuous  on  ]R  and  R be_ 
fixed.  Then 


|f (x)-Ln(wQ,f ,x)  | 
for  lx  | > — — and 

~ ft  — 


f(x)-Ln(wa,f,x)  | 


exp(ix2)].R(f,-i) 


for  -1  < a < 0 


for 

for 


a = o 

a > 0 


for  | x | < 1 / <y~n  . These  estimates  cannot  be  improved  in  the 
sense  of  Theorems  10  and  11. 


5.  We  conclude  Part  I by  mentioning  a few  more  open 
problems  related  to  pointwise  convergence  of  Lagrange  interpola- 
tion. 


(a)  Let  supp(w)  c:  [-l,l],0<c1<w(x)<c  <oo  for  x e [-1,1] . 
Is  it  true  that 

Pn(w,D  < cPm(w,l)  (n  < m)  ? 

If  m = n-fl  or  m = n+0(l)  the  answer  is  yes. 
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(b)  Let  supp(da)  c 


(log  a' ) 


2 ' T1 

G L 


lim 


Vl(da) 


Y (da) 

n->o°  n 


[-1,1].  Find  conditions,  weaker  than 
which  would  assure  that 

, ^ 1 
exists  and  = j. 


If  a' (x)  > 0 for  almost  every  x e (-1,1)  and  the  above  limit 
exists,  then  it  certainly  equals  1/2. 

(c)  Let  w be  a "good"  weight  in  some  analytic  sense  with  sup- 
port in  [-1,1].  Suppose  that  w(0)  = 0.  How  does  L (w,f,0) 
behave  if  f is  continuous? 


(d)  Try  to  interpolate  at  the  zeros  of  Pollaczek  polynomials. 
These  polynomials  are  the  best  known  ones  with  (log  w)//l-x2'  / 
L^  (see  [91]  ) . 

(e)  If  supp(da)  c;  [-1,1]  and  a1  (x)  > c > 0 for  almost  every 

x e [a,b]  c [-1,1]  then  |p  (da,x)  | = o ( /rT  ) for  x e (a,b). 

n x 

Will  this  estimate  hold  uniformly  in  x e [a^,b^]  c (a,b)?  If 
[a,b]  = [-1,1],  a is  absolutely  continuous  and  a'  is  con- 
tinuous, then  the  answer  is  yes. 

(f)  Let  supp(da)  a [-1,1]  and  <b  be  a continuous,  complex 
valued  function.  Under  what  conditions  does  the  limit 

2 

/ TT  (x)0(x)da(x) 

-1 


A 2 

min  / TT  (x)da(x) 

TTeff  (t)  -1 

n 


lim 


min 

TTelP  (t) 
n 


exist  and  if  it  exists,  what  is  its  value?  Here  IP  (t)  denotes 

n 

the  set  of  polynomials  TT  of  degree  at  most  n with  Tf(t)  = 1. 

(g)  What  happens  if  we  replace  IP  (t)  in  (f)  by  IP  , where 
* n n 

F denotes  the  class  of  those  polynomials  of  degree  exactly  n 
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whose  leading  coefficient  equals  1.  The  interesting  case  is 
when  log  a'  i L^. 

(h)  Let  supp(w)  = [-1,1].  Investigate  the  expression 
^T)  L„<"'fpn-1<")-1> 

where  f is  Riemann  integrable. 

II.  Mean  convergence 

In  this  part  we  will  be  concerned  with  mean  convergence 
of  Lagrange  interpolation.  Most  of  the  results  mentioned  here 
have  been  obtained  very  recently.  I hope  that  in  a few  years' 
time  the  theory  of  mean  convergence  of  Lagrange  interpolation 
will  reach  a point  where  all  the  results  below  will  be  consider- 
ed trivial.  A few  words  on  the  notation  used!  L*3  = ff. 

da  1 ’ 

/ | f ( t ) |pda(t)  < oo ] for  P > 0,  ||f||  = { / I f ( t ) |pda(t)rp 

-CO  00  * “ -00 

for  p > 1,  (f»g)doo  = / f (t)g(t)da(t),  an(t)  is  defined  as  a 

-00 

step  function,  having  jumps  A (da,x  (da))  at  x,  (da)  and 

n kn  kn 

being  constant  between  two  consecutive  zeros  of  p^(da).  Thus, 


<f’8)da  * .Z1£<xkn)s<’Ikn)\n(da)  an<1 

n k=l 


f , is  defined 

' “da  ,p 
n 


analogously.  The  function  w is  defined  as  w (t)  = 

n n 

t 

( / w(x)dx)  . || L (da)  ||  ,Q  denotes  the  (°°,p)  norm  of  the 

n n dp, oo,  p 

linear  operator  L^fda):  C — > L^  defined  by  L^(da) f = Ln(da,  f). 

We  shall  not  bother  to  compute  the  (p,p)  norm  of 
Ln(da)  for  the  simple  reason  that  it  is  never  bounded.  Let  me 
remark  that  Part  II  should  be  considered  as  a sequel  to 
R.  Askey's  paper  [3]  which  helped  me  understand  the  importance 
of  this  theory. 
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1.  In  this  section  let  v denote  the  Chebyshev  weight. 

Suppose  we  are  asked  to  prove  that  the  Lagrange  interpolation 

polynomials  taken  at  the  Chebyshev  abscissas  have  uniformly 

2 

bounded  (<»,2)  norms  as  mapping  from  C to  Lv»  The  solution 
is  very  simple! 


IVV’f)llv,2  = l|f|lv  ,2  — 


IVv  ,2  “ ^ C 1 v,2 ' 

n 


The  only  problem  is  that  this  is  an  essentially  L proof  and 

therefore  it  does  not  help  us  to  obtain  similar  inequalities 

2 

for  L^.  In  a sense  it  is  more  natural  to  compute  the  L norm 
v v 

of  Ln(v,f)  by  using  the  converse  of  Holder's  inequality,  that 


is 


VV’f)|lv,2 


sup 

ill  9<  1 

vf2— 


an(v,f),g)v. 


Denoting  by  Sn(v,g)  the  n-th  partial  sum  of  the  Fourier- 

Chebyshev  series  of  g,  let  us  remark  that  g - S . (v,g)  is 

2 n-1 

orthogonal  to  L^(v,f)  in  L^.  Thus,  using  the  Gauss-Jacobi 
mechanical  quadrature  formula,  we  obtain 

sup 


|Ln(v’f)liv,2 


(f,S  1 (v,g)) 

ii  ^ i n-I  v 

*|lv,2^1 


< 


< lifllr  sup  s . (v,g)  I . 

C ii  ii  „ . n-I  v , I 

The  next  step  would  be  to  show  that  for  every  polynomial  P of 
degree  less  than  n 


(2)  l|P|lv  x < C||P|| 

n*  ’ 


with  a constant  C independent  of  n and  P.  Having  estab- 
lished such  an  inequality  we  could  complete  our  proof  by  remark- 
ing 


|Sn-l(v’g)llv,l  * 


v,2  8 v,2 * 
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This  alternative  approach  has  the  advantage  of  being  based  on 
L techniques.  L inequalities  enter  only  in  the  last  step  to 
give  estimates  of  the  L1  norms  of  Fourier-Chebyshev  sums  (rather 
than  of  interpolating  polynomials).  If  (2)  holds,  then  this  ap- 
proach will  also  yield  i/  inequalities,  since  by  the  cele- 
brated theorem  of  M Riesz , the  Fourier-Chebyshev  sums  are  uni- 
formly bounded  in  every  for  1 < p < «>.  Let  us  now  return 

to  (2).  Following  the  methods  developed  in  [3],  [4],  [14] , and 

[99],  we  represent  P as 

P(x)  = (P, K(x , • ) ) v 


with  a suitably  chosen  kernel  K.  Hence 


'v  ,1<- 
n 


V,1 


sup  j|K (x , * ) ||  , 

|x|  < 1 Vn’ 1 


Some  rather  technical  arguments  show  that  K may  be  chosen 
such  that 

sup  ||K(x,  • )||  i < C. 

|x  | < 1 VL 


This  proof  is  fine  if  we  restrict  ourselves  to  considering  only 
the  Chebyshev  weight  or  some  Jacobi  weights  (see  [4])  but  if  we 
try  to  adapt  it  for  other  weights,  even  the  ones  which  are  very 
close  to  the  Chebyshev  weight,  we  encounter  greatly  increased 
technical  difficulties  which  defy  all  known  methods  and  tricks. 
Instead  we  must  use  a different  approach  for  the  proof  of  (2). 
(see  [62],  [63],  [68]).  By  a simple  computation 


I i < ci 

v , 1 — 1 

n 


lPl'v,l  + C2° 


-li 


P' 


for  every  polynomial  P.  In  particular,  if  the  degree  of  P 
is  less  than  n,  then  by  the  Bernstein  inequality 


lpllv  ,i  < <C  + C2)||p||v  i- 

n 
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We  remark  that  the  second  method  of  proving  (2)  can  also  be 
used  in  a more  general  situation. 

2.  Let  us  now  turn  to  the  formulation  of  the  problem  of 
mean  convergence  of  Lagrange  interpolation  processes.  We  fix 
two  weights  a and  p.  Let  f be  defined  on  U supp  (d(3) 


(Aa  = 


inf 

supp  (da)  — (a , b ) 


(a,b))  and  let  f be  finite  on  A 


a 


if 


f e LjQ  then  also  R (da,f)  = f - L (da,f)  e L 
dp  n n 


ask  whether  R (da,f)  tends  to  zero  in  Lj, 
n dp 


dP 

as  n 


and  we  can 


In  this  general  setting  the  question  seems  to  be  too  dif- 
ficult to  be  solved.  Let  us  remark  that  by  putting  P(x)  = 
a(x)  + S(x-Xq)  we  find  ourselves  confronting  the  problem  of 
pointwise  convergence  which  obviously  (?)  needs  considerations 
other  than  just  the  question  of  mean  convergence.  Apart  from 
the  case  where  f is  analytic  we  also  cannot  expect  conver- 


gence in  L 


,.Q  if  A A . 

dp  pa 


Let  A denote  the  class  of  those  weights  a for  which 


'da  ,1^C| 

n 


'da,  1 


whenever  P is  a polynomial  of  degree  less  than  n.  The  fol- 
lowing theorem  is  the  key  in  the  proof  of  convergence  of  La- 
grange interpolation  in  Ljn . 

dp 

THEOREM  1 . Let  a e A and  let  p be  absolutely  continuous 
with  respect  to  a.  Then  for  every  p such  that  1 < P < 00 


| L (da) 


L0  < c s . 

dp  ,oo,  p — n-1 


(da) 


dP, co,  p' 


So  far  we  have  not  been  able  to  find  any  weight  a e A with 
noncompact  support.  Nevertheless,  there  exists  a rather  large 
class  of  weights  which  certainly  belongs  to  A.  Let  us  say 
that  w ~ w^  if  supp(w)  = supp(w^)  and  0 < c^  < w(x)/w^(x) 

< C2  < oo  for  almost  every  x e supp(w). 
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THEOREM  2 . 


This 


THEOREM  3. 
nomial  P 


I £ w ~ w ^ ^ then 

is  a consequence  of  the 

If  w ~ w^’^  and 
of  degree  m < const  n 


< C 


w e A. 

following  more  general  result. 
1 < p < cd  then  for  every  poly- 


Many  particular  cases  of  Theorem  3 have  been  proved  in 
[3],  [62],  [63],  [64],  [68].  See  [82]  for  the  general  case. 

The  proof  of  Theorem  3 is  based  on  the  use  of  certain  BernsGein- 
Markov  inequalities,  estimates  of  the  zeros  of  the  corresponding 
orthonormal  polynomials  and  estimates  of  Christoffel  functions. 
There  are  dozens  of  fine  papers  treating  weighted  LP  inequali- 
ties for  the  derivatives  of  polynomials’,  see  e.g.  [15],  [16], 
[51],  [61],  [65],  [84],  [86],  [88],  [90],  and  [99].  The  most 
satisfactory  Bernstein-Markov  inequality,  however,  has  been 
proved  by  B.  Khalilova  [53]  whose  very  pretty  result  goes  as 
follows : 


THEOREM  4.  If  P e IP  then 

— n 


P* 


w 


(cc  + p/2  ,f}+p/2 ) 


< Cn  | P 


w 


(a, (3) 


for  a, (3  > -1  and  p > 1 . 


The  next  theorem  is  needed  to  deal  with  the  Christoffel 
functions  and  the  zeros  of  orthogonal  polynomials  in  the  proof 
of  Theorem  3.  (See  [21],  [24],  [32],  [76],  [82],  [91].) 

THEOREM  5.  Let  w ~ w(a,P\  Then 


An(w,x) 


2(3+1 


for 


x | < 1 and 


- 9k„<w) 


r^j 


l 

n 
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for  k = 0,1,..., n where  xkn(w)  = cos  ekri(w)  with  xQn(w)=l 
and  x , , (w)  = -1. 

Having  Theorems  1 and  2,  the  next  step  is  to  apply  results 
on  weighted  mean  convergence  of  orthogonal  Fourier  series.  Such 
questions  will  be  discussed  in  the  next  section. 

2 

3.  If  supp(da)  is  compact  then  for  every  f e Ldw 
n 

sn(da,f)  = Z (f,pk(da))dapk(da) 

k=0 


converges  to  f in  Lda  when  n -»  oo.  Much  less  can  be  said 


about  the  convergence  of  Sn(da,f)  in  L^. 


In  the  case  where 

a and  (3  are  Jacobi  weights  (not  necessarily  the  same)  or  are 

close  to  Jacobi  weights  in  an  analytic  sense,  the  problem  of 

convergence  of  S (da,f)  in  Lj  has  now  been  almost  complete- 
n dp 

ly  solved.  On  the  other  hand,  when,  for  instance,  a is  abso- 
lutely continuous  and  a'  is  the  Chebyshev  weight  multiplied 
by  a function  which  is  merely  continuous,  nothing  is  known  about 

the  convergence  of  S (da,f)  in  iZ  . For  known  results,  we 

n dQ! 

refer  the  reader  to  [2],  [6],  [7],  [8],  [9],  [10],  [48],  [50], 
[66],  [67],  [83],  [85],  [97],  and  [99]. 

For  our  purposes  we  need  theorems  of  this  type  which  can 
be  combined  with  Theorems  1 and  2 to  give  mean  convergence  of 
Lagrange  interpolation.  The  strongest  available  result  is  that 
of  V.  M.  Badkov  [11].  He  has  dealt  with  the  case  when  supp(w) 

= [-1,1], 


w(t)  = h(t)w 


(a,P) 


(t)  TT 

j=l 


|t-x. 


lYj 


h (t ) > 0,  h(t)  c C,  uo(h,  6)&-1  e L1 , 

0!,p, y,  > -1  (j  = 1,2,... ,m),  -1  < x.  < ...  < x < 1. 

j L m 


For  such  weights  w he  considered  the  mean  convergence  of 
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S (w,f)  in  1/  where 
n v 


v<t)  . w<A-B)(t)  fr  it-*  i 

i=l  J 


A>  B»  ^ > -1  (j  = 1,2,... ,m),  X.  are  the  same  as  above. 

THEOREM  6.  Le£  1 < p <C  °».  Then  ||s  fw)||  < C for  pvpfy 

n v,p,p  — 

n iff 


(3)  sup  <||p  (w)||  ><  °°,  sup  < ||p  (w)w||  . 

n >0  l V>PJ  n > 0 1 n v1/1_P, p/p-1 


< 00. 


THEOREM  7.  Conditions  (3)  are  equivalent  to  the  system  of  in- 
equalities . 


4.  From  this  point  it  becomes  easy  to  obtain  results  on 
mean  convergence  of  Lagrange  interpolation  from  the  results  men- 
tioned above.  Let  us  write  w«  w^*’^  if  supp(w)  = [-1,1], 


w ~ w(a,P),  w/w(a/P)  e C,  w/w(a/p)  > 0 and  the  modulus  of  con- 

.(a,P) 


tinuity  oo(S)  of  w/w' 


satisfies  the  condition 


/ 4^- d6  < °° • 

0 6 

THEOREM  8.  Let  w « w(a»^ 
integrable  for  some  e > 1. 
verges  to  f in  LP[-1,1] 
(i)  max (Q!,P)  < - j, 

(ii)  min(a,P)>-i, 


, (a.b)  e 

and  v = uwv  * 7 where  u is 

For  every  f e C [-1 , 1]  Ln(w,f)  con- 

when  n ->  oo 

a = 0,  b = 0 and  p > 0 
u is  bounded  in  some  neighborhoods 
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of  -1  and  1 and 


(iii)  a < - j < p,  u is  bounded  in  a neighborhood  of  -1, 
a = 0,  b > and 


0 < p < 


4(b+l) 
2 P+1 


(iv)  p < - J < a,  u 
b = 0 and 


is  bounded  in  a neighborhood  of 


1, 


0 < p < 


4(a+l) 
2a+l  ' 


Note  that  the  conditions  imposed  on  p are  also  necessary. 
This  is  clear  from  the  following  theorem. 

THEOREM  9.  Let  w =»  w^*’^  and  v = uw^a,b\  Suppose  that 
Ln(w,f)  tends  to  f in  for  every  f e C.  Lf  a > - j 

and  u ^ is  bounded  in  a neighborhood  of  1,  then  necessarily 


P < 


4 (a+1 ) 

2a+l 


and  similarly  if  p > - j and  u ^ is  bounded  in  a neighbor- 
hood  of  -1  then 


P < 


4 (b+1 ) 
2P+1  * 


The  method  which  I used  to  prove  Theorem  8 cannot  be  ap- 
plied to  prove  convergence  of  L (da,f)  to  f in  LjQ  when 

n ap 

f e in  the  Riemann-Stielt jes  sense.  However,  if  a = P, 

this  method  will  probably  work.  Theorems  8 and  9 have  been 
proved  in  [82].  They  contain  earlier  results  proved  in  [3], 
[4],  [19],  [22],  [23],  [52],  [63],  [92],  and  also  the  classical 

2 u 

L theorems  of  P.  Erdos-P.  Tura'n  [21]  and  J.  Shohat  [89]. 

It  should  be  possible  to  show  that  the  weight 
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in  v 

w(t)  = h(t)w(a,P)(t)  TT  It-x  I j 

j=l  J 

(Yj  > -1.  Xj  e (-1,1),  o < < h (t ) < c2  < oo) 

belongs  to  A and  thus  to  extend  Theorem  8 for  such  weights. 

A further  step  would  be  to  investigate  the  convergence  of 
Ln(da,f)  in  when  f e in  the  improper  Riemann- 

Stieltjes  sense.  G.  Freud's  book  [32]  is  a good  starting  point 
for  such  studies.  For  those  who  prefer  the  method  of  "nice 
kernel  functions"  to  that  of  Bernstein-Markov  inequalities  I 
recommend  the  recent  monograph  [5]  of  R.  Askey. 

As  mentioned  above,  all  the  weights  we  know  to  be  in  A 
have  compact  support.  For  this  reason  the  above  machinery  does 
not  work  when  supp(da)  is  not  compact.  Nevertheless,  after  a 
little  modification  it  becomes  applicable  for  certain  weights 
with  noncompact  support.  This  question  will  be  discussed  in 
the  following  section. 

5.  Recent  investigations  of  mean  convergence  of  Lagrange 
interpolation  processes  corresponding  to  weights  with  noncompact 
support  arose  from  the  physical  problem  of  obtaining  approximate 
expressions  for  the  Fourier  transforms  of  functions  which  are 
only  known  experimentally.  One  reasonable  approach  to  this 
question  is  to  replace  the  function,  which  is  supposedly  "known" 
only  at  certain  point,  by  an  interpolating  polynomial,  then  to 
take  the  Fourier  transform  of  this  polynomial  and  to  see  whether 
it  is  in  fact  a good  approximation  to  the  Fourier  transform  of 
the  function  itself.  As  was  shown  in  [13],  if  the  function  is 
given  at  the  Laguerre  nodes  (w(x)  = exp(-x),  x e 1R+)  this 
method  works  well.  To  be  more  precise, 

lim  / L (w,f,t)e  t+*xtdt  = / f(t)e  t+ixtdt 

n— >oo  0 n 0 
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uniformly  in  x e ]R  provided  that  f is  continuous  and 
lim  f(t)  exp(-et)  = 0 for  some  e < j.  This  result  gave 

t— >+°°  2 

rise  to  [12]  and  [26]  where  the  problem  of  convergence  in  L 

was  thoroughly  investigated. 

J.  Balazs  and  P.  Tura#n  [12]  dealt  with  the  case  where  OC 

is  absolutely  continuous  and  w = a'  can  be  represented  in  the 

form  w = h/g  where  h e L^(]R)  and  g is  an  even,  infinitely 

many  times  differentiable  function  which  satisfies  the  condi- 
(k) 

tions:  g (x)  >0  (k  = 0,2,...),  log  g is  convex  function 

of  log  x and 

/"•12L a|£i  . ^ 

0 1+t 

THEOREM  10.  U f e C(]R)  and  lim  f(x)/  /g(x)'  = 0 then 

X — >oo 

2 

Ln(dCt,f)  converges  to  f iji  1^(10  when  n ». 

The  following  special  case  of  Theorem  10  is  of  particular 
interest. 

THEOREM  11.  Let  f e C(]R)  and  f(x)  = 0[exp(ex^)]  when 
x -»  oo  with  some  e < j.  Then  the  corresponding  Lagrange  in- 
terpolation polynomials  taken  at  the  Hermite  abscissas  converge 
to  f in  L^(]R)  with  w(x)  = exp(-x^). 

In  order  to  formulate  G.  Freud's  result  in  [26]  we  have  to 
introduce  a definition.  We  say  that  the  weight  a belongs  to 
U if  the  equalities 

00  00 

/ xnda(x)  = / xndP(x)  (n  = 0,1,...) 

-00  -00 

imply  that  Q£(x)  - a(-oo)  = p(x)  - P(-«>)  at  every  point  x 
where  a or  P is  continuous. 

THEOREM  12.  Let  0!  e U and  let  G be  a function  which  is 
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infinitely  many  times  differentiable  and  satisfies  the  condi- 
tions G^k\x)  > 0 (k  = 0,2 , ...  ) and  G e L^.  Then  for 

every  function  f with  lim  f(x)/  \/G(x)  =0,  L (da,f)  tends 

x— >°° 

2 2 
to  f in  LdQ,(]R)  provided  that  f e • 

Here,  and  in  the  sequel  Lda(]R)  should  be  understood  in 
the  improper  Riemann-Stielt jes  sense;  that  is,  £ * OR)  if 
a 

(R-S)  / | f ( t ) |Pda(t) 

-a 

exists  for  every  a > 0 and 
a 

lim  (R-S)  / |f(t)  |Pda(t)  < oo. 

a-»oo  -a 

We  shall  not  go  into  the  details  of  the  proofs  of  the  re- 
sults in  this  section;  we  only  mention  that  they  are  essentially 
2 

L proofs  and  cannot  be  generalized  to  handle  the  case  of  con- 
vergence in  LP.  (See  also  [93].)  Both  P.  Tura'n  [92]  and  G. 
Freud  [32]  have  raised  the  question  of  what  happens  in  LP. 

Of  course,  there  is  virtually  no  hope  of  obtaining  LP  con- 
vergence theorems  for  general  weights  while  the  classical  cases, 
namely  Hermite  and  Laguerre  weights,  remain  uninvestigated. 

R.  Askey  [3],  recalling  an  old  idea  of  J.  Marcinkiewicz  [63], 
realized  that  the  question  of  mean  convergence  of  Lagrange  in- 
terpolation can  be  reduced  to  investigation  of  mean  convergence 
of  orthogonal  Fourier  series  (see  Theorem  1).  It  turns  out  to 
be  unfortunate  that  R.  Askey  knows  a very  great  deal  about  mean 
convergence  of  orthogonal  Fourier  series  and  he  knew  well  that 
there  exist  no  good  convergence  theorems  in  LP  for  Fourier- 
Hermite  and  Fourier-Laguerre  series,  respectively.  For  this 
reason  he  wrote  in  [3]  that  "the  lack  of  nice  theorems  (for  LP 
convergence)  suggests  that  there  are  only  fairly  weak  results 
to  be  obtained  for  Lagrange  interpolation  at  the  zeros  of  the 
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Laguerre  or  Hermite  polynomials.  Turan  raised  this  question... 
and  I too  would  like  to  see  some  results  on  this  question.  How- 
ever, I am  afraid  they  will  be  weaker  than  one  might  have  ex- 
pected!" 4 

Askey  was  right  only  partially.  There  are  both  weak  and 
strong  theorems  for  mean  convergence  of  Lagrange  interpolation 
taken  at  the  Hermite  or  Laguerre  abscissas. 

6.  Throughout  this  section  w will  denote  the  Hermite 

2 p /2 

weight;  that  is,  w(x)  = exp(-x  ),  and  let  w(p)  = • The 

weak  result  referred  to  above  is  the  following: 

THEOREM  13.  Let  p > q > 1.  Then  there  exists  a continuous 

function  f supported  in  [-1,1]  such  that  \ 

lim  sup  IlL  (w,  f ) ||  _ = 

n ->  oo  n (q),P 

in  particular,  for  every  p > 2 one  can  find  a continuous  f 
with  support  in  [-1,1]  such  that 

lim  sup  IlL  (w,f)|l  = oo. 

1 n "w,p 

The  proof  of  this  theorem  is  surprisingly  easy.  Since  1 

we  can  consider  Ln(w)  38  linear  operators  from  C[-l/2,l/2] 

into  LP  (]R)  , by  the  Banach-Steinhaus  theorem  it  is  enough  ‘ 

W(q) 

to  construct  a sequence  { f ^3  with  fR  e C[-l/2,l/2],  || f n llc < 1 
and 

lim  sup  IlL  (w,f  ) II  =oo. 

n n w.  .,p 
n ->  oo  (q) 

The  existence  of  the  above  sequence  [f^]  follows  immediately 
from  the  next  two  theorems. 

THEOREM  14.  For  every  n there  exists  a function  e 
C [-1/2 ,1/2]  with  l|fnllc  = 1 and  a number  x^  e ( /n  - 3,  /n  + 3) 
such  that 
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”(l)(!'„)|Ln(”l£n','n)|  ^C- 
THEOREM  15.  Let  P e IP  . Then  for  every  p > 1 

l|P"(2/p)llc  ^ Cn1/2Pl|P|l„>p. 

Theorem  14  has  been  proved  in  [29].  Theorem  15  is  obvious 

when  p is  an  even  natural  integer  and  for  the  other  values  of 

p it  can  be  proved  by  an  interpolation  (not  Lagrange  but  Riesz- 

Thorin)  argument.  Theorem  13  suggests  that  if  we  want  to  have 

convergence  in  LP  we  should  consider  convergence  in  LP  (]R) 

rather  than  in  LP(]R). 

w 

THEOREM  16.  Let  f e C(1R)  and  satisfy  the  condition 

(4)  lim  f(x)xw.j.(x)  = 0. 

Then  for  every  p > 1, 

(5)  lim  ||L  (w,  f ) - f ||  =0. 

IHoo  n (p)’P 

It  is  quite  possible  that  a condition  weaker  than  (4)  could 
also  imply  (5).  On  the  other  hand,  it  should  be  possible  to 
show  that  the  conditions 


f e L^  (]R)  0 C(R)  , lim  f(x)w  .(x)  = 0 

(p)  X— »oo  ^ ' 


are  too  weak  to  imply  (5).  The  proof  of  Theorem  16  is  rather 
complicated;  it  is  based  on  the  following  two  theorems. 

THEOREM  17.  Let  P £ ]Pn  and  p > 1.  Then 

11*11*  „ s ciK">p|l»  «• 

(P),P  ^ ' (P)’P 

where  l*n)  is  the  characteristic  function  of  [-c^/n,  c^/n  ] 
with  an  appropriate  constant  c^  and 
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!|p'IIw  < c xAT  ||p| 

(P),p  (p)’1 


THEOREM  18.  Let  P e IP  with  m < const  n.  Then  for  every 
p > 1,  ae  H , 0 < b < p 


-1 


(6)  PG  J < C PG  _ . , 

" a,b"wn>p  — a+2/p,b'w,p 

and 


(7)  PG  .1,  J < C |PG  , 

11  a,b  (n)"wn,p  - 11  a,b  w, p 


3 2 

where  Gfl  ^(x)  = (l+|x|)  exp(bx  ) and  1 ^ is  the  character- 

istic function  of  [-  i/rT,  /t?] . 


The  very  important  Theorem  17  has  been  proved  by  G.  Freud 
[35].  It  has  nice  applications  in  the  theory  of  weighted  L*3 
apr  ximation  which  has  been  developed  by  G.  Freud  in  [33]-[41]. 
Let  us  remark  that  if  we  put  p=2,  a=b=0  in  (6)  we  get 
an  inequality  which  is  not  sharp  for  polynomials  of  degree  less 
than  n,  since  on  the  right  side  we  have  an  extra  factor 
( 1+ }x | ) . Hence  it  is  possible  that  this  factor  can  also  be 
omitted  in  the  general  case,  that  is,  w e A.  Instead  of  prov- 
ing Theorem  16  let  us  discuss  an  essential  step  in  its  proof 
which  we  formulate  as 


THEOREM  19.  Let 

M = sup  (1+  |x|)w^(x)|f(x)|  < oo. 
X€  K 

Then  for  every  p > 1 


Proof.  We  have 


I = ||L  (w,  1 .f)l.  J 

n (n)  (n)  w(p)»P 


sup  (L  (w,I ( ), f ),g  ) 

l8llw,q^1 


where  q = p/(p-l)  and  g*  = 1(n)gw(p_2/p)*  Hence 


i 
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I = sup  (L  (W,l  f),  S (w,g  )) 

llg|lv>q<i  ■>(■»■> 

= sup  (L  (w,l  .£),  S (w,g*)) 

il*llw>q<  i ” (n) 

- M sup  l|l(n)G_i  1/2sn(».g*)llw 
1 n 


and  by  (7) 


I < CM  sup  ||G  , i /PS  (w,g*)|| 

lldl  <i  1>1/2  n W>1 


"w,q  - 


From  this 


I < CM  sup  sup  (G  1 i /,S  (w,g*),h) 

IlglL  < l llhll  < l ” 


"""c  — 

sup  sup  (g*,S  (w,G  . i /oh))  . 

||  <1  h <1  n *1’1'2  W 

"w,q  — " "c — 


By  Holder's  inequality 

I < CM  sup  lllgnxSn(W,G  i/9h)|| 

||h||  <1  (n)  n _1’1/2  W(P)»P 


Here  the  right-hand  side  can  easily  be  estimated  remembering 
that  for  every  <t)  e L^(1R) 


Y (w) 

|Sn(w,0)|=  - — |pn(w)H(^Pn+1(w)w)  - Pn+1(w)H(0Pn(w)w)  | 


where  H denotes  the  Hilbert  transform.  To  finish  the  proof 
we  use  some  well-known  properties  of  Hermite  polynomials  and 
obtain  I < CM. 

We  refer  the  reader  interested  in  mean  convergence  of 
Fourier-Hermite  series  to  [7],  [67],  [83],  and  [85]. 

7.  Similar  theorems  can  be  proved  for  the  generalized 
Hermite  or  Laguerre  weights.  These  results  will  soon  be 
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published.  However,  attempts  to  extend  these  results  to  weights 
other  than  the  classical  ones  come  up  against  serious  difficul- 
ties. These  difficulties  arise  because  not  much  is  known  about 
the  behavior  of  orthogonal  polynomials  corresponding  to  weights 
with  noncompact  support.  For  certain  weights,  like 
| x | exp  (-|x|P)  and  exp(g(x))  (where  g is  a "good"  func- 
tion), some  properties  of  the  corresponding  orthogonal  polyno- 
mials are  already  known.  These  properties  include  exact  esti- 
mates of  the  Christoffel  functions,  of  distances  between  conse- 
cutive zeros  and  of  leading  coefficients  (see  [18],  [20],  [42]- 
[47],  [75],  and  [76]).  A next  step  would  be  to  find  asymptotic 
formulae  or  at  least  useful  bounds  for  the  orthogonal  polyno- 
mials. In  particular,  it  would  be  interesting  to  have  a proof 
of  the  following  conjecture. 

CONJECTURE.  Let  w(x)  = exp(-jx|a)  (Ct  > 0,  x e ®)  . Then 
2 

Pn(w,x)w(x)  < C (x  e ]R) 

and  there  exists  a constant  c^  such  that 

p^(w,x)w(x)  < C^1^  ( |x  | < Cjn1^), 

or.  more  generally,  if  w(x)  = exp(- |x  |a)h(x)  where  h is  a 
"very  good"  strictly  positive  and  bounded  function,  then  the 
above  inequalities  hold. 

Even  for  just  the  special  case  a = 2,  a proof  of  the 
second  form  of  the  conjecture  would  be  of  considerable  interest. 

8.  We  conclude  this  survey  with  a few  open  problems. 

(a)  Find  conditions  assuring  the  existence  of  a p > 2 such  that 
Pn(w)  is  uniformly  bounded  in  L^. 

(k)  (k) 

(b)  Investigate  mean  convergence  of  (da,f)  to  fv  , pro- 

vided that  f^)  £ c> 

(c)  Determine  the  asymptotic  value  of  sup  ||L  (da,f)-f|| 

feM  n dS.p 
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for  n -»  oo  if  M is  a compact  subset  of  C. 


(d)  Estimate  x.  , (w)  - x,  (w)  when  w is  between  two  dif- 

v k+1 , n kn 

ferent  Jacobi  weights. 


(e)  Investigate  the  mean  convergence  of  L*(da,f)  which  coin- 
cides with  f at  {x.  (da)}  U {+l}. 

kn  — 

(f)  Does  A (see  Theorem  1)  contain  any  weight  supported  on 
two  disjoint  intervals? 

(g)  Let  supp(w)  = [-1,1].  If  (log  w)//l  -x^  e then  for 
every  linear  functional  F on  L^ 


2 

lim  F(pn(w)w)  = F(v) 

n-*oo 

where  v is  the  Chebyshev  weight.  Is  the  condition  log  weL 
necessary? 


(h)  Give  exact  estimates  for 
when  it  is  not  bounded. 


IV” 


(a,P)| 


w 


(a,b  ) 


(i)  If  supp(da)  cr  [-1,1]  and  (log  a')/ 
An(da,x)p^(da,x)  = ctx C 1 ) for  x e (-1,1). 
extimate  hold  uniformly  on  [a,b]  c;  (-1,1) 


n 2 t 1 
1-x  e L 

When  will 


then 

this 


Finally,  I would  like  to  thank  Richard  Askey  and  Michael 
Civikel  for  having  kindly  consented  to  read  a draft  version  of 
this  survey  and  to  offer  several  helpful  suggestions  for  im- 
proving the  presentation. 
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FITTING  SURFACES  TO  SCATTERED  DATA 
Larry  L.  Schumaker 

This  paper  is  a survey  of  a variety  of  numerical  methods 
for  fitting  a function  to  data  given  at  a set  of  points  scat- 
tered throughout  a domain  in  the  plane.  We  discuss  four 
classes  of  methods:  (1)  global  interpolation,  (2)  local  inter- 

polation, (3)  global  approximation,  and  (4)  local  approximation. 
We  also  discuss  two-stage  methods  and  contouring.  The  surfaces 
constructed  will  include  polynomials,  spline  functions,  and  ra- 
tional functions,  among  others. 

1 . Introduction 

Our  aim  is  to  survey  methods  for  solving  the  following 
problem. 

PROBLEM  1.1.  Let  D be  a domain  in  the  (x,y) -plane,  and  sup- 
pose F is  a real-valued  function  defined  on  D.  Suppose  we 
are  given  the  values  F^  = F(x^,  y^)  of  F at  some  set  of 
points  (x^,y^)  located  in  D,  i = 1,2, ...,N.  Find  a function 
f defined  on  D which  reasonably  approximates  F. 

This  problem  is,  of  course,  precisely  the  problem  of  fit- 
ting a surface  to  given  data.  In  many  cases  the  domain  D is 
a rectangle  and  the  data  points  lie  on  a rectangular  grid. 

There  are,  however,  many  practical  problems  (see  the  following 
section  for  some  specific  examples),  where  D is  of  unusual 
shape  and  where  the  data  points  are  irregularly  scattered 
throughout  D.  Thus,  while  we  shall  pay  some  attention  to  spe- 
cial methods  for  regularly  spaced  data,  we  are  actually  more 
interested  in  the  general  case. 

There  are  basically  two  approaches  to  handling  Problem  1.1. 
First,  we  may  try  to  construct  a function  f which  interpolates 
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the  data  exactly;  i.e.,  such  that 

(l.D  f(xi,yi)  = Ft,  i = i,2,...,n. 

This  approach  may  be  desirable  when  the  function  values  at  the 
data  points  are  known  to  high  precision  and  where  it  is  highly 
desirable  that  these  values  be  preserved  by  the  approximating 
function. 

The  second  approach  involves  constructing  f which  only 
approximately  fits  the  data.  This  may  be  regarded  as  data 
smoothing  and  will  be  desirable  when  (as  is  often  the  case) 
the  data  are  subject  to  inaccurate  measurement  or  even  errors. 
The  question  of  whether  interpolation  or  approximation  should 
be  used  will  not  be  discussed  further  here--this  is  a problem 
which  must  be  settled  for  the  individual  problem  at  hand. 

In  discussing  Problem  1.1,  it  will  be  convenient  to  make 
a further  distinction  between  those  methods  which  are  local  in 
character  (i.e.,  where  the  value  of  the  constructed  surface  f 
at  the  point  (x, y)  depends  only  on  the  data  at  relatively 
nearby  points)  and  those  methods  which  are  global  in  nature. 
Thus,  we  discuss  four  categories  of  methods  in  sections  3-6: 

(1)  global  interpolation,  (2)  local  interpolation,  (3)  global 
approximation,  and  (4)  local  approximation.  In  each  of  these 
sections  we  further  subdivide  the  material  according  to  the 
type  of  functions  being  used  and  the  type  of  data  (scattered  or 
not)  for  which  the  method  is  suitable. 

In  discussing  methods  which  apply  only  to  special  arrange- 
ments of  data  points,  we  have  two  objectives  in  mind.  First, 
the  methods  are  of  interest  in  their  own  right.  More  important- 
ly in  terms  of  Problem  1.1,  however,  such  methods  can  also  be 
used  in  two-stage  processes  in  which  we  first  construct  a sur- 
face g based  on  the  scattered  data,  and  then  use  g to  gen- 
erate regular  data  for  the  construction  of  another  (perhaps 
smoother  or  more  convenient)  surface  f.  Such  two-stage  methods 
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will  be  discussed  (along  with  several  examples)  in  more  detail 
in  section  7. 

For  many  of  the  methods  based  on  regular  data  and  some  of 
those  for  scattered  data,  error  bounds  are  available  to  indi- 
cate how  well  smooth  functions  are  approximated  by  the  surface 
constructed.  We  do  not  have  space  to  go  into  the  extensive 
literature  on  error  bounds.  A simple  test  of  how  well  a method 
will  approximate  smooth  functions  is,  however,  provided  by  its 
ability  to  reproduce  polynomial  surfaces  exactly  (that  is,  if 
F is  a polynomial  in  x and  y up  to  a certain  degree,  then 
the  surface  f is  identically  equal  to  F) . For  many  of  the 
methods  we  will  be  able  to  indicate  the  corresponding  degree  of 
exactness . 

In  many  of  the  applications  of  surface-fitting  techniques 
(cf.  the  examples  in  section  2),  the  ultimate  aim  is  to  use  the 
data  to  construct  a contour  map  of  the  unknown  function.  Since 
F is  known  only  at  the  data  points,  we  must  be  content  to  con- 
struct a contour  map  for  one  of  our  fitted  surfaces.  In  sec- 
tion 8 we  discuss  some  approaches  to  accomplishing  this  numeri- 
cally. 

We  close  this  introduction  with  a disclaimer--this  survey 
does  not  include  all  possible  methods  for  fitting  surfaces  to 
scattered  data.  For  example,  we  have  not  discussed  Fourier 
series  methods,  spatial  filtering,  and  other  such  related  sta- 
tistical techniques.  In  addition,  the  set  of  references  for 
those  methods  which  we  have  discussed  are  also  not  complete. 

My  original  intention  was  to  compile  as  complete  a bibliography 
as  possible,  but  the  sheer  bulk  of  relevant  papers  and  my  in- 
ability to  locate  all  of  them  convinced  me  to  settle  for  less. 

I have  opted  to  quote  a fairly  representative  list  of  papers. 
Including  several  other  surveys.  Further  references  can  be 
found  by  consulting  these.  I shall  be  very  happy  to  receive 
information  on  references  and  methods  I have  overlooked. 


205 


LARRY  L.  SCHUMAKER 


2.  Examples 

In  this  section  we  shall  quote  several  explicit  examples 
of  Problem  1.1  to  emphasize  the  fact  that  unusually  shaped  re- 
gions and  scattered  data  do  arise  frequently  in  practice. 

EXAMPLE  2.1.  Petroleum  exploration.  In  exploring  for  petro- 
leum, the  contours  of  various  underground  layers  of  sandstone, 
shale,  limestone,  etc.  can  be  important  indicators  of  possible 
oil  fields.  Frequently,  data  on  such  layers  is  available  from 
exploratory  wells,  which,  however,  have  most  likely  been  drilled 
at  locations  scattered  randomly  throughout  some  geographical  re- 
gion of  interest.  To  quote  a specific  example,  Robinson, 
Charlesworth,  and  Ellis  [166]  consider  precisely  this  problem 
for  some  data  obtained  from  7,500  wells  drilled  in  Alberta.  For 
another  example  of  this  type,  see  Whitten  and  Koelling  [208]. 

Problems  similar  to  that  mentioned  in  Example  2.1  arise 
frequently  in  cartography  and  submarine  topography  where  the 
measurements  represent  actual  elevations.  In  some  cases  the 
measurements  must  be  taken  from  photographs  or  from  sonar  mea- 
surements and  are  usually  subject  to  some  measurement  error 
(eg.  see  Kubik  [125]  for  a discussion  of  photogramme try) . 

EXAMPLE  2.2.  Geological  maps.  There  are  a great  many  problems 
in  Geology  and  the  earth  sciences  in  which  the  data  arises  from 
some  other  function  of  location  besides  actual  elevations.  For 
example,  some  geological  variables  of  interest  might  include 
concentrations  of  various  chemicals,  specific  gravity,  electri- 
cal resistivity,  grain  size,  texture,  optical  properties,  iso- 
tope ratios,  etc.  To  quote  a specific  example,  Bhattacharyya 
[21,  22]  discusses  methods  for  fitting  a surface  to  measurements 
(taken  by  airborne  sensors)  of  magnetic  potentials  over  a cer- 
tain portion  of  the  Yukon.  See  also  Bhattacharyya  and  Raychaud- 
huri  [23]  and  Crain  and  Bhattacharyya  [61]. 


206 


FITTING  SURFACES  TO  SCATTERED  DATA 


The  importance  of  surface- fitting  methods  in  the  earth  sci- 
ences can  be  judged  by  the  large  number  of  papers  in  the  area 
relating  to  various  fitting  methods.  For  a further  list  of 
problems  and  a discussion  of  some  of  the  methods  which  have  been 
applied,  see  the  books  of  Bohrenberg  and  Giese  [31],  Chorley 
[51],  David  [62],  Harbaugh  and  Merriam  [98],  and  Merriam  [140]. 
Recent  survey  papers  include  Whitten  [203,  205]  and  Whitten  and 
Koelling  [207].  To  add  just  a few  more  of  the  papers  in  the 
geological  literature  dealing  with  surface  fitting  to  our  list, 
we  mention  Anderson  [7],  Grant  [91],  Hessing,  Lee,  and  Pierce 
[114],  Holroyd  and  Bhattacharyya  [115],  Kubik  [123,  125],  Nor- 
cliffe  [151],  Reilly  [162],  Whitten  [200,  201,  204],  and  Whit- 
ten and  Koelling  [206]. 

EXAMPLE  2.3.  Heart  potentials.  In  order  to  diagnose  certain 
abnormal  heart  conditions,  it  is  desired  to  make  a series  of 
several  hundred  contour  maps  of  the  heart  potential  field  at 
time  steps  of  1/100  of  a second  throughout  a heart  beat.  Data 
on  these  heart  potentials  can  be  obtained  by  fitting  the  patient 
with  a shirt  containing  probes.  Because  of  body  geometry,  when 
this  shirt  is  flattened  out  it  takes  the  nonrectangular  form 
illustrated  in  Figure  1.  Although  the  probes  could  be  arranged 
fairly  regularly  in  this  domain,  because  of  the  added  signifi- 


Figure  1.  Heart  Potential  Measurements 
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cance  of  frontal  measurements,  in  practice  more  probes  are 
fitted  there  than  in  the  back.  This  example  was  brought  to  my 

* 

attention  by  Ms.  Patrizia  Ciarlini  of  Rome. 

Potential  fields  arise  in  many  other  applications.  We  * 

have  already  mentioned  Geology  in  Example  2.2.  For  some  exam-  ■ 

pies  in  modelling  plasmas  see  Buneman  [40].  The  problem  arises  i 

in  Biersack  and  Fink  [24]  in  experimentally  studying  crystal 
structure  using  neutron  bombardment.  Data  from  waveform  dis- 
tortion in  electronic  circuits  can  be  found  in  Akima  [5,  6] . J 

I 

3 . Global  interpolation  methods 
In  this  section  we  outline  several  methods  for  solving  the 

\ 

interpolation  problem  (1.1). 

3.1  Polynomial  interpolation.  (Scattered  data) . The  general 

theory  of  finite  dimensional  interpolation  is,  of  course,  very  I 

N 

well  known  (e.g.,  see  Davis  [63]).  Briefly,  if  [0^)^  are  N 
functions  defined  on  the  domain  D,  then  the  function 


(3.1)  f (x,  y)  = Ea.0.(x,  y) 


J=1 


J J 


will  satisfy  (1.1)  if  and  only  if  {a.}^  is  a solution  of  the 
linear  system 
n 

(3.2)  Za0(x,y)=F,  i=l,2,...,N. 

j=l  J J 

This  system  has  a (unique)  solution  for  arbitrary  choices  of 
data  precisely  when  it  is  nonsingular.  This  depends  on  the 
choice  of  functions  (0.)^  and  the  location  of  the  data  points. 

J 1 jq 

To  illustrate  this  method,  we  may  choose  the  [0^]^  to  be 

polynomials  in  x and  y.  Given  N,  there  is  some  leeway  in 

the  choice  of  which  powers  of  x and  y to  use.  For  example, 

with  N = 3 one  could  use  the  functions  1,  x,  y or  possibly 

2 2 

the  functions  1,  x , y , etc.  When  N is  of  the  form  N = 
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(d+l)(d+l),  we  might  use  the  functions 


As  simple  as  this  sounds,  there  are  some  serious  difficul- 
ties with  polynomial  interpolation  of  scattered  data.  For  open- 
ers, it  is  not  so  easy  to  guarantee  that  the  system  (3.2)  is 
nonsingular.  To  give  a very  simple  example,  consider  the  case 
N = 3 with  the  functions  1,  x,  y.  If  the  three  data  points 
happen  to  lie  on  a line,  then  (3.2)  will  in  fact  be  singular. 
Even  when  (3.2)  is  nonsingular,  it  will  often  be  the  case  (at 
least  if  N is  moderately  large)  that  the  system  will  be  ill- 
conditioned.  Finally,  as  is  well  known,  polynomials  of  even 
moderate  degree  exhibit  a considerable  oscillatory  character, 
and  the  resulting  surface  (even  though  it  is  C°°)  is  often  too 
undulating  to  be  acceptable.  The  general  problem  of  polynomial 
interpolation  to  scattered  data  is  not  usually  treated  in  Nu- 
merical Analysis  and  Approximation  Theory  books  (see,  however, 
Kunz  [126],  Prenter  [157],  and  Steffenson  [186]).  Some  papers 
dealing  with  the  question  include  Guenther  [93],  Thatcher  [189], 
Thatcher  and  Milne  [190],  and  Whaples  [197].  Assuming  the  in- 
terpolant  exists,  error  bounds  have  been  studied  in  Ciarlet  and 
Raviart  [52-55]. 

Let 


(3.3) 


m,  n 


r V u-,m 
span  [x  yK) 


V=0 , p=0 


be  the  space  of  polynomials  of  degree  m in  x and  of  degree 
n in  y.  This  linear  space  is  of  dimension  (m+1) (n+1)  and 
is,  in  fact,  the  tensor  product  of  the  linear  spaces  SP  and 


m 


n 


It  is  perhaps  of  interest  to  note  that  there  always  exists 


a (usually  nonunique)  polynomial  P e N solves  the 

interpolation  problem  (1.1),  no  matter  how  the  data  points  are 
positioned,  see  Prenter  [158]. 
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3.2  Polynomial  interpolation  (gridded  data) . We  begin  this 
subsection  by  defining  what  we  mean  by  gridded  data.  Let 

(3.4)  H = [a, b]  x [c,d] 

be  a rectangle,  and  let 


a ~ 

< X, 

< . . . 

. . . < 

x,  , = b 

0 

1 

k+1 

O 

II 

<*1 

< ... 

...  < 

yi+i  “ d- 

We  suppose  now  that  F is  a function  defined  on 
we  have  the  values  of  F at  the  corner  points  of 
lar  grid  defined  by  (3.5);  i.e., 


(3.6)  F.  . = F(x.,y.), 

l j i7  j ’ 


i = 0,1,.. . ,k+l 
j =0,1,...,  £+1 . 


H,  and  that 
the  rectangu- 


This  is  a total  of  N = (k+2) (£+2)  data  points. 

It  is  quite  easy  to  show  that  there  exists  a unique  poly- 
nomial p in  the  class  P , A (cf.  the  definition  (3.3)) 

R+  1 y i/">  JL 

which  interpolates  the  gridded  data  given  in  (3.4) -(3.6).  In 
fact,  p can  be  written  down  explicitly  in  terms  of  the  one- 
dimensional Lagrange  polynomials  as 


k+i  £+1 

(3.7)  p (x,  y)  = Z Z F L (x)L  (y), 
i=0  j=0  1 J 

where  the  (L^x)}^1  and  {L . (y)  3q+1  are  the  usual  one- 
dimensional Lagrange  polynomials  associated  with  the  interpola- 
tion points  an<*  {y.j]o+1>  respectively.  Interpolation 

of  gridded  data  by  polynomials  has  been  discussed  in  various 
books  and  papers--we  do  not  bother  with  a long  list.  See  e.g. 
Prenter  [157]  or  Steffenson  [186].  More  recently,  there  has 
been  considerable  work  on  Hermite  and  osculatory  interpolation 
in  several  variables;  see  e.g.  Ahlin  [3],  Haussman  [99,101,102], 
and  Salzer  [168-170]. 
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3.3  Shepard's  method.  In  this  subsection  we  discuss  a method 
of  Shepard  [180 j and  some  modifications  of  it.  The  method  ap- 
plies to  arbitrarily  spaced  data,  and  the  interpolating  function 
can  be  written  down  explicitly. 

Let  p be  some  metric  in  the  plane,  for  example  the  usual 
distance  metric.  Given  a point  (x, y) , let  r^ = p( (x, y) , (x^, y^)) 
for  i = 1,2, ...,N.  Let  0 < p < «.  Then  Shepard's  interpola- 
tion formula  is  defined  by 


N F . 


(3.8)  f(x,y)  = J 


i=l  V' 


N 


Z -M/I  Z M 


U=i  V 


when  r^  4 0,  all  i 


when  r^  = 0, 


The  formula  (3.8)  is  defined  for  all  points  (x, y)  in  the 
2 

plane  R . It  is  clear  from  the  definition  that  it  interpolates 
the  values  at  the  data  points  (x^,y^),  i = 1,2, ...,N. 

The  value  of  f(x,y)  at  nondata  points  is  obtained  as  a weight- 
ed  average  of  all  the  data  values,  where  the  icn  measurement 
is  weighted  according  to  the  distance  of  (x,y)  from  the  point 
(xi,yi)  . 

We  shall  briefly  recount  some  of  the  properties  of  Shep- 
ard's formula.  First,  by  converting  all  of  the  terms  to  a 
common  denominator,  it  can  be  shown  that 
N 

(3.9)  f(x,y)  = Z F A (x,y), 
i=l  1 1 

where 

TT  tr  (x,y>r 

j=l  J 

i = 1,2,  ...,N  . 


(3.10)  A.(x,y) 


N N * 

Z Z [r  (x,  y)  ] 4 

k=l  Z=l  t 


These  functions  satisfy 
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(3.11)  (x^, y^)  - 1>  J - 1,2, 

The  representation  (3.9)  is  numerically  more  stable  than  the 
original  formula  (3.8). 

In  view  of  its  definition,  we  see  that  the  function  f(x, y) 
constructed  by  Shepard  is  not  a simple  polynomial  or  rational 
function.  It  is  clear,  however,  that  except  for  the  points 
(xi,yi),  it  is  analytic  everywhere  in  the  plane.  Its  behavior 
in  the  vicinity  of  the  data  points  (xi>y^)  depends  on  the  size 
of  p.  It  can  be  shown  that  for  0 < p < 1,  f has  cusps  at 
these  points.  For  1 < p,  f has  flat  spots  at  the  data  points 
(i.e.,  the  partial  derivatives  vanish  there).  We  also  observe 
the  interesting  property  that 

(3.12)  min  F < f(x, y)  < max  F.. 

lSiSN  1 l=>i§N  1 

We  may  also  note  that  if  the  data  came  from  a constant  function, 
i.e.,  F^  = c,  i = 1,2, ...,N,  then  f is  also  the  constant 
function  f ■ c. 

We  now  comment  on  the  choice  of  p.  To  get  smooth  surfaces 
without  cusps,  it  is  desirable  to  take  1 < p.  On  the  other 
hand,  if  p is  relatively  large,  then  the  surface  tends  to  be- 
come very  flat  near  the  data  points  and  consequently  quite  steep 
at  points  in  between.  Experiments  (cf.  Gordon  and  Wixom  [90], 
Poeppelmeir  [155],  and  Shepard  [180])  seem  to  indicate  that  a 
choice  of  p = 2 is  perhaps  a good  tradeoff.  ([155]  contains 
several  examples  showing  the  behavior  as  a function  of  p.) 

There  are  several  drawbacks  to  Shepard's  method  (3.8),  as 
pointed  out  by  Shepard  [180]  himself.  First,  if  N is  large, 
then  there  is  a very  considerable  amount  of  calculation  in- 
volved in  evaluating  f(x, y)  at  a particular  point.  Secondly, 
the  weights  are  assigned  on  the  basis  of  the  distance  of  points 
from  (x, y)  only,  not  their  direction.  Finally,  the  flat  spots 
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in  the  neighborhood  of  the  data  points  is  somewhat  disturbing. 
The  first  of  these  objections  can  be  met  by  defining  a local 
version  of  the  formula,  which  we  shall  do  in  section  4.5.  It  is 
possible  to  construct  an  analogous  formula  which  accounts  for 
direction.  For  details,  see  Shepard  [180].  Finally,  we  briefly 
discuss  handling  the  flat  spots. 

Suppose  in  addition  to  the  function  values  F^  at  each 


point  (xi,yi)  we  also  have  estimates 


FX.  and  FY. 
1 i 


of 


F (x.,y. ) and  F (x.,y.).  Then  we  may  consider  the  function 
x i"i  y l i 


N 


(3.13)  f(x,y)  = l,  (x,  y)  [F^  + (x-x^FX.^  + (y-yjFY^. 


It  is  easily  checked  that  this  function  also  interpolates,  and 
that 

(3.14)  fx(xi,yi)  = FXi>  fy(Xi,yi)  = FV  1 = 

This  property  may  be  expressed  in  the  assertion  that  if  the 
data  F^*FX^,FY^  came  from  a plane  surface  F,  then  f will 
exactly  reproduce  this  surface.  To  use  formula  (3.13)  in  prac- 
tice on  the  data-fitting  Problem  1.1,  we  have  to  carry  out  a 
two-stage  approximation  process  in  which  the  first  stage  con- 
sists of  some  method  for  estimating  the  slope  at  each  of  the 
data  points. 

It  might  be  of  practical  interest  in  some  cases  to  con- 
struct still  a more  sophisticated  version  of  Shepard's  formula 
which  would  exactly  reproduce  higher-order  polynomial  surfaces. 
One  approach  to  doing  this  is  to  use  the  following  lemma. 

LEMMA  3.1.  (Barnhill  [15]).  Let  P and  Q be  linear  projec- 
tions of  some  linear  space  of  functions  9 into  itself.  Sup- 
pose that  Q exactly  reproduces  the  linear  subspace  E c:  9; 
i.e. . 

(3.15)  Qp  = p,  all  p e E. 
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In  addition,  suppose  that  [A.  } ^ is  a set  of  linear  functionals 
on  9,  and  that 

(3.16)  A^f  = A^f,  all  f e 9,  i = 1,2,  . ..,m. 

Then  the  Boolean  sum  projector 

(3.17)  P ® Q = P + Q - PQ 

enjoys  the  function  precision  of  Q (i.e.,  reproduces  E)  and 
the  interpolation  properties  of  P (i.e.,  (3.16)  also  holds 
for  P ® Q)  . 

This  result  permits  the  construction  of  interpolation 
schemes  using  Shepard's  formula  which  reproduce  higher-order 
surfaces.  For  an  example,  see  Poeppelmeir  [155]  where  Shepard's 
formula  is  combined  with  a certain  local  interpolation  scheme 
which  reproduces  quadratic  surfaces.  In  closing  this  section 
we  note  that  Shepard's  formula  can  also  be  interpreted  as  aris- 
ing from  weighted  least  squares--see  section  5.1. 

3.4  Spline  interpolation  (scattered  data) . Suppose  X is  a 
linear  space  of  "smooth"  functions  defined  on  the  domain  D, 
and  let 

(3.18)  U = [f  e X:  f^y^  = F£,  i = 1,2, 

U is  the  set  of  smooth  functions  which  interpolate.  Now  sup- 
pose that  0 is  a functional  on  X which  measures  the  smooth- 
ness of  an  element  in  X--the  smaller  0(f)  is,  the  smoother 
f is.  Then  we  may  consider  the  following  minimization  problem: 

(3.19)  Find  s e U such  that  0(s)  = inf  0(u) . 

ueU 

The  function  s will  be  the  smoothest  interpolant,  and  in  view 
of  the  similarity  with  classical  spline  approximation,  s is 
called  a spline  function  interpolating  F.  The  basic  questions 
concerning  spline  interpolation  center  around  existence,  unique- 
ness, characterization,  and  construction.  A quite  general 
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abstract  theory  of  spline  interpolation  has  been  built  up  (see 
eg.  Laurent  [127]  and  references  therein) . In  this  section  we 
quote  some  specific  examples  which  can  be  used  on  Problem  1.1. 

Where  X is  a semi-Hilbert  space,  0(f)  = ||f||,  where  ||*|| 
is  a seminorm  on  X,  and  N = [f  e X:  ||f||  = 0},  it  is  possible 
to  show  (under  some  additional  mild  conditions  on  X,  see  Duchon 
[72,  73])  that  problem  (3.19)  always  has  a solution  which  is 
unique  up  to  an  element  in  N.  Moreover,  it  can  be  shown  that 
there  exists  a reproducing  kernel  K defined  on  DxD  such 
that 

N d 

(3.20)  s (x,  y)  = Z a K((x,y)  ; (x  ,y  ))  + Z b p (x,  y) , 
i=l  1 i=l 


r id 

where  l p ^ J ^ is  a basis  for  N.  Moreover,  the  coefficients 
[a^]  and  [b^]  can  be  determined  from  the  linear  system  of 
equations 


N 


(3.21)  ZiK((xj,yj);(xi,yi))at  + Z btPi(x.,y.)  = F.,  j=l,...,N 


Z a*Mx<'yP  = °>  k = 1,  2,  . . . , d . 

i=l  1 K 1 1 


The  development  with  semi-Hilbert  spaces  in  Duchon  [72,73] 
is  an  extension  of  earlier  work  of  Atteia  [10-12]  and  Thomann 
[192-193]  using  Hilbert  spaces.  The  essential  difficulty  in 
applying  the  general  results  is  the  construction  of  an  appro- 
priate reproducing  kernel.  We  turn  now  to  some  specific  exam- 
ples. 

Suppose  X is  the  space  of  all  functions  on  the  rectangle 
D = H (cf.  (3.4))  which  have  (distributional)  derivatives  up  to 
order  2 which  lie  in  L2(H).  For  f e X,  let 

(3.22)  0(f)  = //  | D 2 f | 2 + 2|D  D f |2  + | D^f  |2. 

D x x y y 
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The  reproducing  kernel  in  this  case  can  be  written  down  as  an 

infinite  series  involving  sin  and  cos,  and  the  space  N is 

spanned  by  1,  x,  and  y.  Similarly,  if  we  replace  H by  the 

unit  disc  UD,  the  kernel  can  be  computed  as  an  infinite  series 

(see  Atteia  [10-12]  and  Thomann  [192-193]).  Thomann  considers 

computation  of  these  splines  by  approximating  the  infinite 

series--FORTRAN  programs  are  also  included. 

If  we  replace  the  bounded  sets  H or  UD  by  the  entire 
2 

plane  R and  introduce  an  appropriate  space  X,  it  is  possible 

to  obtain  explicit  expressions  for  the  reproducing  kernel.  This 

is  the  content  of  Duchon  [72,73].  In  particular,  let  Hs 

2 

be  the  set  of  all  tempered  distributions  f on  R whose 
Fourier  transforms  f satisfy  /|f|t^sdt  < oo.  Let  Xms  de- 
note the  set  of  all  functions  which  have  derivatives  up  to 

order  m lying  in  H . Our  first  example  concerns  the  space 
20 

X . If  we  choose  0 as  in  (3.22),  then  the  interpolating 
spline  solution  of  (3.19)  is  of  the  form 
N 2 

(3.23)  s (x,  y)  = Z airi(x,y)  log  (r£(x,y))  + bjX  + b2y  + b3, 

where  r^(x, y)  = [(x-x^)  + (y-y^)  ] 2.  The  coefficients  are  de- 
termined from  the  system  (3.21)  with  d = 3,  N = span  [l,x,y], 
and  K(z,w)  = |z-w  | log(z-w|.  Duchon  refers  to  this  type  of 
spline  as  a thin  plate  spline  since  the  expression  0 relates 

to  the  energy  in  a thin  plate  forced  to  interpolate  the  data. 

2 

This  spline  belongs  to  C(R  ) . 

21 

As  a second  example,  suppose  we  consider  X = X . In 
this  case  the  solution  of  (3.19)  with  0 given  by  (3.22)  has 
the  form 

N 

(3.24)  s (x,  y)  = Z a (r  (x,y))  + b x + b y + b . 

i=l  1 1 

Here  K(z,w)  = |z-w|  . Duchon  [72,  73]  refers  to  these  splines 
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as  pseudo-cubic  splines  because  of  the  analogy  with  the  cubic 
splines  in  one  variable.  They  belong  to  C^(R).  Pseudo  quintic 
splines  etc.  are  also  considered  in  Duchon  [72,73]. 

A similar  program  has  been  carried  out  by  Mansfield  [133- 
137]  for  some  spaces  of  smooth  functions  defined  on  a rectangle 
H.  In  [136  ] she  considers  a space  of  functions  Tm,n(a,p), 
where  m and  n are  positive  integers  and  a<a<b,  c<f3<d. 
This  space  is  actually  defined  by  completion  of  a set  of  tensor 
product  functions  with  respect  to  an  appropriate  inner-product, 
and  we  do  not  want  to  define  it  precisely  here.  A function 
f e Tm^  1 (a, P)  has  the  following  properties,  however: 

r 


(3.25)  < 


(i,j) 

e C 

(H),  i = 0 

,1, 

• • • j m- 1 

and  j =0,1, 

• • • y n 

(s- j- 

l,j> 

(x,  P)  eAC[a 

,b] 

and  f^S 

j)  (x,(3)  eL2 

[a,  b] 

j = 0,1,.. 

• , n*  1 

(i,  s- 

i-1) 

(a,y)  e AC  [c 

,d] 

and  f^# 

S-i)(a,y)  eL2 

[c,d] 

i = 0,1,.. 

. ,m- 1 

(m-1. 

n-1) 

e AC(H)  and 

f(m,n)  £ 

l2(h), 

AC 

stands  for  the 

space  of  absolutely  continuous 

tions  and  where  s = m+n.  By  constructing  an  appropriate  re- 
producing kernel,  she  is  able  to  solve  problem  (3.19)  with 

n-1  b 


(3.26)  9(f)  = //[  f 

H 


(m,n)]2+  £ j [f(s- j,  j)  (^p)  ]2dx 
j=0  a 

Sz  /‘V-^Wnv 

i=0  c 


In  [133],  Mansfield  carries  out  a similar  analysis  for  a 
space  of  functions  Rm’ n defined  on  the  rectangle  H.  Here 
Rm,n  = L2  [&>  b]  x L^tcjd],  where  L^^b]  is  the  usual  Sobolev 
space  of  functions  with  absolutely  continuous  derivatives  up  to 
order  m-1,  and  with  f ^ e L2[a,b].  By  constructing  an 
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appropriate  reproducing  kernel,  she  now  solves  problem  (3.19) 
with 

(3.27)  0(f)  = //  [f(m'n)]2+  Z / [f(m,j)(x,c)]2dx 

H j=0  a 

m- 1 d . , \ rs 

+ Z / [f  ,n^  (a,y)  ] dy  . 
i=0  c 

The  solution  turns  out  to  be  a piecewise  polynomial  of  degree 

2m- 1 in  x and  of  degree  2n-l  in  y.  It  is  also  in 
9m—  9 9 n _ 9 

C * (H) . For  the  particular  case  of  gridded  data,  it  re- 

duces to  the  tensor  product  of  one-variable  splines  (cf.  the 
following  section) . Other  more  general  definitions  of  0 are 
also  considered  (with  minor  modifications  on  the  one-dimensional 
integrals) . 

A more  algebraic  approach  to  constructing  multidimensional 
spline  functions  (which  also  involves  certain  kernel  functions) 
has  been  taken  by  Schaback  [173-174],  His  two-dimensional  ker- 
nel function  is  obtained  as  a tensor  product  of  one-dimensional 
kernels. 

3.5.  Spline  interpolation  (gridded  data) . The  problem  of  con- 
structing interpolating  splines  in  two  dimensions  with  gridded 
data  as  in  (3.4) -(3. 6)  is,  of  course,  a special  case  of  the 
general  problems  discussed  in  subsection  3.4.  The  development 
of  the  gridded  data  case  predated  the  more  general  development 
and,  moreover,  is  considerably  simpler.  There  are  a great  many 
papers  on  two-dimensional  polynomial  splines  and  generaliza- 
tions. We  do  not  have  space  here  to  discuss  all  of  them  in  de- 
tail. We  shall  be  content  to  quote  some  of  the  papers  and  to 
give  a somewhat  more  complete  discussion  of  polynomial  splines, 
which  are  the  most  widely  used  splines  for  this  problem. 

Some  early  papers  dealing  with  two-dimensional  interpola- 
ting splines  include  Birkhoff  and  de  Boor  [26],  Birkhoff  and 
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Garabedian  [21],  Price  and  Simonson  [159],  and  Theilheimer  and 
Starkweather  [191].  In  [26]  certain  bicubic  splines  were  intro- 
duced which  were  later  studied  in  detail  in  de  Boor  [32].  The 
problem  was  to  minimize 

(3.28)  / f [f(2’2)  (x,y)]2dxdy 
a c 

over  all  appropriately  smooth  functions  on  the  rectangle  H 

which  interpolate  the  gridded  data  (3.4) -(3.6).  It  was  found 

that  the  solution  of  this  problem  was  a certain  bicubic  func- 

2 

tion  with  global  smoothness  C (H) . This  problem  was  genera- 
lized to  minimizing 

bd  , . ? 

(3.29)  9(f)  = f f [f^n;  (x,y)]cixdy  , m = 2p,  n = 2q 

a c 

in  Ahlberg,  Nilson  and  Walsh  [1,2],  whose  solution  involves 
certain  higher-order  polynomial  splines.  Since  they  are  widely 
used,  we  give  a short  algebraic  treatment  here. 

The  points  and  (y^^  de^ine  a partition  of  the 

intervals  [a,b]  and  [c,d],  respectively  (cf.  (3.5;).  Suppose 
now  that  x,  <...<x,<a<b<x,  _<...<  x.  . and 

y1-n  < •••<y_1<c<d<  y£+2  - •••  - yi+n-iare  chosen  arbi_ 

trarily.  Let  be  tbe  B-splines  associated  with  the 

x-partition,  and  let  the  B-splines  associated  with  the  y-parti- 

tion  be  denoted  by  (N^(y)]f  . For  a complete  discussion  of 

J 1-n 

B-splines  and  their  properties,  see  de  Boor  [36]  in  this  volume 
(or  [33]).  Let 

(3.30)  N^j(x,  y)  = N“(x)N^(y),  i = 1-m,  ...,k  and  j = 1-n, 

The  linear  space 

(3.31)  S.  .pan  <*.y)  J=l-n 

is  clearly  of  dimension  (k+m) (i+n) . We  may  now  construct  an 
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element  in  (3.31)  which  interpolates  to  the  gridded  data. 

Since  there  are  only  (k+2)(i+2)  data  points  on  the  grid  (cf. 
(3.4) -(3.6),  it  is  clear  that  if  we  use  S to  interpolate,  we 
have 


(3.32)  (k+m)  (i+n)  - (k+2)  (-2+2)  = (k+2)  (n-2)  +(4+2)  (tn-2)  +(n-2)  (m-2) 


free  parameters.  Thus,  to  uniquely  define  a spline,  one  must 
add  additional  conditions.  Recall  that  m = 2p  and  n=2q.  Then 
we  might  add  the  extra  conditions 


(3.33) 


s<V’°><Vyj)  = s<V’°)<xk+i’yj)  =0> 

s<0’ll)  ( x ± , yQ ) = s<0,'->  (>=1,yi+1)  '0, 


j =0,1, . . .,i+l 
V = p,  . . . , m-  2 

i =0,  1,  . . . ,k+l 
U = q,  . . .,n-2 


and 


(3.34) 


S'V,H)  <VV  " s<V,ll)  <VW  - s<V,tl)  <*k+i’ V 


s<V’t‘)<’tk+l’yi+l)  = 0’  V=p,...,m-1 

M-  — ^1;  • • • y J.  • 


These  are  called  the  natural  boundary  conditions,  and  it  can  be 
shown  that  the  system  of  equations 


(3.35) 


a = 0,  l, . . ,,k+l 
p = 0, 1, . . .,i+l 


coupled  with  the  conditions  (3 .33) - (3.34)  provides  a nonsingular 
system  of  equations  for  the  coefficients  {a„}.  This  system 
has  convenient  bandedness  properties  if  the  equations  are  ar- 
ranged properly.  The  resulting  spline  is  precisely  the  solution 
of  the  minimization  problem  (3.29).  The  boundary  conditions 
(3 ,33) - (3 .34)  are  the  natural  ones  associated  with  the  problem 
(3.29).  However,  it  is  also  possible  to  specify  lower-order 
derivative  information  along  the  boundary  and  also  obtain  a 
nonsingular  system  of  equations.  The  resulting  spline,  called 
Type  I,  can  also  be  shown  to  satisfy  an  appropriate  minimization 
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problem.  However,  for  data- fitting  purposes,  to  use  the  inter- 
polant  with  boundary  derivative  data  one  would  first  have  to 
perform  a first-stage  approximation  to  find  estimates  for  the 
required  derivatives. 

The  best-known  case  of  the  above  spline  interpolation  is 

the  case  m = n = 4,  i.e.,  bicubic  spline  interpolation.  In 

this  case  the  surface  constructed  is  a piecewise  bicubic  with 

2 

global  smoothness  C (H) . The  natural  boundary  conditions  set 
second-derivative  values  to  0.  Programs  for  computing  natural 
bicubic  interpolating  splines  can  be  found  in  the  IMSL  Library 
[117]  in  FORTRAN.  FORTRAN  programs  for  Type  I bicubic  splines 
can  be  found  in  Koelling  and  Whitten  [121],  where  the  required 
boundary  information  is  assumed  to  be  input.  An  ALGOL  program 
for  computing  Type  I bicubic  splines  in  which  boundary  data  are 
automatically  computed  by  fitting  one-dimensional  splines  ap- 
pears in  Spath  [183]. 

Bicubic  spline  interpolation  has  been  widely  applied.  For 
some  references  in  the  Geology  literature,  see  eg.  Anderson  [7], 
Bhattacharyya  [22],  Holroyd  and  Bhattacharyya  [115],  Koelling 
and  Whitten  [121],  and  Whitten  and  Koelling  [206], 

Problem  (3.29)  has  been  widely  generalized  in  the  spline 
literature.  Instead  of  minimizing  ordinary  derivatives,  one 
may  introduce  general  linear  operators,  and  instead  of  dealing 
with  point  evaluation  functionals,  more  general  linear  function- 
als may  be  permitted.  To  list  some  (but  by  no  means  all)  papers 
dealing  with  such  generalizations,  we  mention  Arthur  [8,9], 
Birkhoff,  Schultz  and  Varga  [29],  de  Boor  [34],  Delvos  [65,66], 
Delvos  and  Schempp  [68,69],  Delvos  and  Schlosser  [70],  Fisher 
and  Jerome  [78,79],  Haussmann  [100],  Haussmann  and  Munch  [104], 
Munteanu  [143,144],  Nielson  [148,150],  Ritter  [164,165],  Sard 
[171,172],  Schoenberg  [176],  Schultz  [177,178],  Spath  [184,185], 
and  Zavialov  [209-212].  On  L-shaped  regions  and  other  polygons 
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see  Birkhoff  [25]  and  Carlson  and  Hall  [44-49]. 

We  close  this  section  by  mentioning  another  direction  of 
generalization  which  has  led  to  a considerable  development,  the 
idea  of  spline  blending.  These  methods  are  useful  for  construc- 
tion of  a surface  which  interpolates  not  only  function  values 
at  isolated  points  but  on  the  grid  lines  themselves;  i.e., 


f(x,y_j) 

= F (x,  y j) 

a < 

x < b 

and  j = 0, 1, . . . , i+1 

f (x^y) 

= F(xt,y) 

c s 

y < d 

and  i = 0,1,...,  k+1 . 

To  use  such  blending  methods  one  must  have  F defined  on  the 
grid  lines.  Thus,  the  methods  could  be  of  value  as  second-stage 
processes.  We  do  not  have  space  to  go  into  detail  on  spline- 
blended  methods.  We  refer  to  the  recent  book  of  Barnhill  and 
Riesenfeld  [20]  for  a collection  of  papers  on  the  subject  and 
for  further  references.  See  also  the  papers  of  Gordon  [84-87] 
and  Gordon  and  Hall  [88].  Recently,  considerable  effort  has 
gone  into  showing  that  blending  methods  also  arise  as  solutions 
of  appropriate  variational  problems;  see  the  papers  of  Delvos 
[65],  Delvos  and  Rosters  [66],  and  Delvos  and  Malinka  [67]. 

4 . Local  interpolation  methods 

The  interpolation  methods  discussed  in  section  3 were  glo- 
bal in  nature-- that  is,  the  value  f(x, y)  of  the  constructed 
surface  at  any  given  point  (x, y)  in  D depends  on  the  values 
of  all  of  the  data  points.  This  generally  means  that  to  compute 
a representation  for  f one  has  to  solve  a fairly  large  system 
of  equations,  and  to  evaluate  f(x,y)  one  generally  has  to 
carry  out  a considerable  amount  of  arithmetic.  In  this  section 
we  shall  consider  local  schemes  where  the  surface  depends  only 
on  nearby  data  points.  Then  the  construction  will  usually  lead 
to  (a  possibly  large  number)  of  small  systems  of  equations,  and 
moreover,  the  evaluation  of  the  surface  at  a given  point  will 


222 


FITTING  SURFACES  TO  SCATTERED  DATA 


usually  involve  very  little  computation. 

Many  of  the  schemes  mentioned  in  section  3 can  be  made  lo- 
cal in  nature  by  the  following  simple  approach.  Suppose  that 

d 

the  domain  D is  partitioned  into  subdomains:  D = U D . We 

i=l  1 

then  seek  a surface  in  the  form 

(4.1)  f(x,y)  = (fi(x,y),  (x,y)  e i = l,2,...,d. 

To  construct  each  individual  f , we  suppose  that  are  do- 

mains containing  D^,  which  contain  only  points  which  are  "near" 
. Then  we  use  the  data  (and  only  the  data)  in  to  con- 

struct  fj,.  Usually,  we  can  choose  = D^.  In  most  cases 
the  most  convenient  choices  for  the  subdomains  are  trian- 

gles and  rectangles.  We  discuss  these  two  cases  first. 

4.1.  Triangular  subregions  (scattered  data) . Suppose  that  we 
are  given  data  at  points  = (x^,y^),  i = 1,2,  . ,.,N  scatter- 
ed throughout  the  plane,  and  let  D be  the  convex  hull  of 
these  points.  It  is  more  or  less  clear  that  by  drawing  lines 
from  point  to  point  we  can  construct  a set  of  triangles  with 
vertices  at  the  which  partition  D.  It  is  also  clear  that 

given  any  set  of  points,  this  triangularization  of  D is  not 
usually  uniquely  defined  (see  Figure  2 below  for  two  different 
triangularizations  of  the  same  region) . Moreover,  as  the  fig- 
ure shows,  some  triangularizations  are  superior  to  others  in 
the  sense  that  they  exhibit  fewer  of  the  less  desirable  long 
thin  triangles. 

<€> 

Figure  2.  Triangularization 
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The  design  of  an  algorithm  to  divide  a region  into  accept- 
able triangles  with  vertices  at  prescribed  points  is  not  as 
easy  as  it  sounds.  Two  algorithms  in  the  literature  which  are 
designed  to  give  good  triangularizations  can  be  found  in  Caven- 
dish [50]  and  in  Lawson  [128]. 

The  simplest  approach  to  defining  a local  interpolating 
surface  is  to  construct  f ^ (x,  y)  to  be  of  the  form  a^  + a^x  + 
a^y  in  each  triangle.  The  data  at  the  three  corners  of  the 
triangle  determine  the  coefficients  for  that  piece  of  f (the 
corresponding  system  will  be  nonsingular  provided  the  triangle 
is  nondegenerate) . This  procedure  produces  a piecewise  linear 
surface  which,  in  fact,  will  be  globally  continuous.  This  last 
property  follows  from  the  fact  that  along  the  sides  of  the  tri- 
angle the  functions  reduce  to  straight  lines  joining  the  ver- 
tices. This  method  has  been  used  by  several  authors  for  data 
fitting,  e.g.,  Lawson  [128]  and  Whitten  [206],  For  some  con- 
touring routines  based  on  this  local  interpolation  scheme,  see 
section  8. 

If  we  desire  to  interpolate  several  sets  of  data  defined 
on  the  same  triangularization,  it  may  be  more  convenient  to 
compute  Lagrangian  functions  rather  than  to  compute  the  surface 
in  each  triangle  separately.  In  particular,  it  is  clear  that 
we  can  construct  functions  [0 ^ (x, y)  with  the  property 

(4.2)  0j(xi,yi)  = 8 , i,  j = 1,2,  ...,N. 


These  functions  can  be  constructed  as  pyramids  in  such  a 


way  that  the  function 
surrounding  the  point 


has  support  only  on  the  triangles 
(Xj,yj)  (see  Figure  3).  In  terms  of 


these  Lagrangian  functions,  the  interpolating  surface  is  given 


(4.3)  f(x,y)  = ZF0  (x,y). 

j = l J J 
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Figure  3.  A Lagrange  Element 

The  Lagrangian  approach  to  local  interpolation  is  very 
reminiscent  of  the  finite  element  method  in  which  the  solution 
of  an  operator  equation  is  sought  in  the  form  of  a linear  com- 
bination of  a set  of  functions  (called  elements)  with  the  pro- 
perty (4.2).  (See  e.g.,  Prenter  [157],  Schultz  [179],  or 
Strang  and  Fix  [188].)  There  is  no  need  to  restrict  the  ele- 
ments to  be  piecewise  linear  functions--we  may  use  higher-order 
polynomials,  rational  functions,  or  even  more  complicated  func- 
tions. In  fact,  if  we  are  careful  in  the  construction,  we  may 
be  able  to  construct  elements  with  small  support  but  higher 
global  smoothness. 

There  are  a great  many  papers  in  the  finite-element  litera- 
ture concerned  with  defining  convenient  smooth  elements  (La- 
grangian functions  with  small  support) . To  mention  a few,  see 
Barnhill,  Birkhoff,  and  Gordon  [16],  Barnhill  and  Gregory  [17, 

18],  Barnhill  and  Mansfield  [19],  Birkhoff  and  Mansfield  [28], 
Bramble  and  Zlamal  [39],  Goel  [83],  Hall  [94],  Mitchell  [141], 
Mitchell  and  Phillips  [142],  Nicolaidis  [146,147],  Zenisek  [213], 
Zienkowicz  [214],  and  Zlamal  [215-217].  The  books  on  finite 
elements  of  Aziz  [13],  de  Boor  [35],  Strang  and  Fix  [188],  and 
Whiteman  [198]  should  also  be  consulted. 
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The  construction  of  elements  with  higher-order  smoothness 
becomes  increasingly  difficult.  For  example,  it  is  shown  in 
Mansfield  [137]  that  to  get  an  element  with  support  on  the  tri- 
angles surrounding  P and  with  global  continuity  C (D) , it 

is  necessary  to  use  polynomials  of  degree  5 at  least.  (Matters 
are  somewhat  simpler  on  regular  triangularizations,  see  subsec- 
tion 4.2  below.) 

We  close  this  subsection  by  mentioning  that  it  is  also  pos- 
sible to  perform  interpolation  using  elements  based  on  triangles 
to  data  which  also  involves  derivatives,  or  in  analogy  with  the 
blending  methods,  to  data  which  includes  function  values  along 
the  edges  of  the  triangles.  (See  e.g.,  Barnhill,  Birkhoff,  and 
Gordon  [16],  or  Barnhill  and  Gregory  [17,18].)  These  methods 
are  not  directly  applicable  to  the  scattered  data  Problem  1.1, 
but  may  be  useful  as  second-stage  methods. 

4.2.  Regular  triangularizations.  When  the  data  is  distributed 
such  that  the  region  can  be  triangulated  into  a set  of  congru- 
ent triangles,  then  it  is  extremely  advantageous  to  use  the  La- 
grange approach.  In  particular,  in  this  case  we  can  find  an 
element  0 with  value  1 at  (0,0)  such  that  all  other  elements 
are  translates  of  0.  In  this  case,  f takes  the  form 
N 

(4.4)  f(x,y)  = E F 0((x,y)  - (x  ,y  ))  . 

j=l  J J J 

We  illustrate  this  with  a couple  of  examples.  Suppose  that 
we  are  given  data  at  points  chosen  from  the  collection 

(4.5)  ^ = {(i,j))i  jeZ  U { (i  + k,  j + ^)  jeZ  , Z = (integers). 

These  points  lie  on  the  comers  of  a triangular  grid  as  shown 
in  Figure  4. 

It  is  shown  in  Zwart  [218, p.673]  that  there  exists  a func- 
1 2 

tion  0 e C (R  ) which  is  1 at  the  origin  and  0 at  all  other 
points  in  0,  and  has  support  on  the  shaded  region  in  Figure  4. 
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Figure  4.  A Regular  Triangularization 


This  function  is  constructed  as  a piecewise  quadratic  polyno- 
mial. A similar  element  has  been  constructed  by  Powell  [156] 
(the  figure  on  page  267  of  [156]  should  be  rotated  45°  to  see 
this)  . 

To  give  another  example,  suppose  that  we  consider  the  set 
of  points  fi2  which  lie  at  the  vertices  of  the  grid  defined  by 
equilateral  triangles  shown  in  Figure  5. 


Figure  5.  Another  Regular  Triangularization 
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It  is  shown  in  Fredrickson  [81]  that  there  exists  a function  0 

which  has  value  1 at  the  origin  and  value  0 at  all  other  points 

2 2 

in  . The  function  0 is  in  C (R  ) , consists  of  piecewise 

quartics,  and  has  support  in  the  region  shown  in  Figure  5. 

Fredrickson  also  constructs  a piecewise  cubic  element  with  the 

1 2 

same  support  but  which  is  only  C (R  ) . For  right  triangles 
see  Carlson  and  Hall  [44], 


4.3.  Rectangular  subregions.  In  this  section  we  suppose  that 

we  have  data  given  at  points  lying  on  a rectangular  grid  as  in 

(3.4) -(3. 6),  and  consider  local  interpolation  methods.  The 

simplest  approach  here  (cf.  the  triangularization  case)  is  to 

construct  a separate  bilinear  function  f (x,  y)  = a^  + a^x  + 

a_y  + a. xy  in  each  subrec tangle,  H.  . = [x.,x.  ,]x[y.,y. 

3 4 ° 7 ij  V i + 1 j J+l 

using  the  four  comer  values  to  determine  the  coefficients. 

Since  the  bilinear  patches  reduce  to  linear  functions  on  the 

grid  lines,  the  global  surface  is  C(R) . 

Several  authors  have  considered  constructing  functions  on 


each  of  the  H. . 

tj 


using  higher-order  polynomials. 


This  requires 


additional  information  in  addition  to  the  four  corner  values. 


For  example,  if  one  seeks  a bicubic 


(4.6)  f (x,  y)  = Z / a x1yJ, 
i=0  j=0  -1 

there  are  16  coefficients  to  determine.  These  could  be  deter- 
mined by  the  four  comer  values,  plus  the  values  of  f^,  f^, 

and  f at  each  corner.  To  determine  these,  one  must  perform 
xy  ’ 

some  first-stage  process.  For  some  approaches  to  this,  see 
Akima  [5],  Hessing,  et  al  [114],  and  Shu,  et  al  [181].  A FOR- 
TRAN program  for  Akima's  method  can  be  found  in  [6],  Nonpoly- 
nomial patches  have  also  been  considered;  e.g.,  see  Birkhoff 
and  Garabedian  [27], 

The  Lagrange  (finite  element)  approach  can  also  be  used  in 
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the  case  of  rectangular  gridded  data.  In  particular,  if  we  can 
construct  a function  satisfying  (4.2)  with  local  support,  then 
the  surface  f given  by  (4.3)  will  interpolate  and  the  method 
will  be  local  in  character.  As  before,  the  Lagrange  approach 
is  especially  convenient  if  the  grid  is  regular,  i.e.,  if  all 
subrectangles  . are  congruent.  To  illustrate  this,  suppose 
that  the  H are  actually  the  unit  squares;  i.e.,  the  data 
points  lie  in  the  set 


(4.7)  = { (i, j)  } i,j  e Z,  Z = {integers}. 

To  get  a quadratic  C*  element,  we  may  simply  rotate  the  ele- 
ment of  Zwart  [218]  considered  in  the  last  section  by  45  degrees 
(cf.  Figure  4),  or  we  may  take  the  element  of  Powell  [156], 


4.4.  Parametric  representations.  The  methods  discussed  in  the 
last  section  is  concerned  with  data  given  on  a rectangular  grid. 
By  using  parametric  representations,  it  is  possible  to  construct 
similar  local  interpolating  surfaces  for  data  given  at  the  cor- 
ners of  any  partition  of  D consisting  of  quadrilaterals.  In 
this  section  we  briefly  describe  how  this  might  proceed. 

Suppose  Q is  a particular  quadrilateral  subregion  of  D 
of  interest.  In  addition,  suppose  that  x(s,t),  y(s,t),  and 
z(s,t)  are  functions  defined  on  the  unit  square  U [0,1]  x [0,1] 
with  the  properties  that  as  (s, t)  runs  over  the  boundary  of 
U,  (x (s, t) , y (s, t) ) runs  over  the  boundary  of  the  quadrilateral; 
the  four  corners  of  U correspond  to  the  four  corners  of  Q; 
and  z(s,t)  takes  on  the  desired  data  values  at  the  four  cor- 
ners of  U.  In  this  case,  the  triple  (x(s, t) , y (s, t) , z (s, t) ) 
provides  a parametric  representation  of  a piece  of  surface  de- 
fined over  Q interpolating  the  data. 

The  problem  of  constructing  parametric  representations  of 
interpolating  functions  has  been  considered  in  a number  of  pa- 
pers. Several  papers  on  these  methods  and  a host  of  references 
can  be  found  in  the  book  of  Barnhill  and  Riesenfeld  [20];  see 
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also  the  survey  paper  of  Shu  et  al.  [181].  Such  surfaces  are 
sometimes  called  Coon's  surfaces,  cf.  Coons  [59],  and  are  of 
considerable  interest  in  the  field  of  computer-aided  geometric 
design.  To  mention  just  a few  of  the  actual  papers,  see  Ahuja 
and  Coons  [4],  Eamshaw  and  Youille  [74],  Ferguson  [77],  Hayes 
[107],  Hosaka  [116],  and  Mangeron  [132], 

There  also  has  been  some  effort  directed  towards  construct- 
ing elements  (Lagrange  functions)  associated  with  other  less 
regular  subsets  of  the  plane.  We  mention,  for  example,  the 
work  of  Ciarlet  and  Raviart  [55],  Wachspress  [194,195],  and 
Zlamal  [217]  in  which  elements  are  constructed  for  domains 
involving  curved  edges. 


4.5.  Local  Shepard  methods.  It  is  possible  to  modify  the  meth- 
od discussed  in  subsection  3.3  to  make  it  local.  For  example, 
following  Shepard  [180],  suppose  we  fix  0 < R and  define 


1/r 


0 < r < f , 


(4.8)  \|» (r)  = ^ R/3  < r < R, 


R < r 


This  function  is  continuously  differentiable  and  vanishes  iden- 
‘ R. 

r 


tically  for  r > R.  Now  with  r^  as  in  (3.8),  we  define 


(4.9)  f(x,y)  = < 


£ F [i|r(r  )]M 

i=l  1 1 

N 

Z Nr(r.)]^ 
i=l  X 


, when  r^^O,  all  i 


, when  r^  = 0. 


2 

Formula  (4.9)  is  defined  at  all  (x,y)  in  the  plane  R . 
By  definition  it  interpolates  the  values  F^  at  the  data 
points  (x^,y^),  i = 1,2, ...,N.  The  values  at  non-data  points 
are  obtained  as  weighted  averages  of  the  data  values  F^,  but 
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only  those  which  lie  at  points  within  a distance  of  R of 
(x,y).  Thus,  the  formula  is  local. 

To  use  this  method  in  practice  it  is  necessary  to  choose 
a reasonable  value  for  R.  The  aim  is  to  find  R so  that  for 
every  (x, y)  a reasonable  number  of  data  points  will  fall  in 
the  disk  centered  at  (x,y)  of  radius  R . It  would  also  be 
possible  to  let  R depend  on  (x,y),  i.e.,  to  use  different 
values  of  R in  different  subregions  of  D. 

5 . Global  approximation 

As  mentioned  in  the  introduction,  frequently  the  data  does 
not  warrant  constructing  an  interpolating  function  (e.g.,  be- 
cause of  errors) . In  such  cases  it  may  be  preferable  to  con- 
struct a surface  which  only  approximates  the  data.  In  this  sec 
tion  we  discuss  some  global  approximation  methods. 


5.1.  Polynomial  least  squares.  The  general  theory  of  discrete 
least-squares  fitting  is  very  well  known.  To  briefly  review, 
suppose  that  (0^)^  are  n given  functions  on  D.  Define 


(5.1)  «(a) 


N n 

Z Za0(x,y)-F 

i=l  j=l  J J 1 1 1 


T n 

where  a = (a^, ...,3^)  is  any  vector  in  R . 

lem  is  to  find  a*  such  that 


Then  the  prob- 


(5.2)  <K(a*)  = min  $>(a)  . 

a 

The  corresponding  function 
n 

(5.3)  f(x,y)  = Z a*  <t  (x,y) 

j=l  J J 

is  called  the  discrete  least-squares  approximation  of  the  data 
( Ff } i . Usually  one  takes  n considerably  smaller  than  N.  In 
this  section  we  briefly  discuss  least  squares  using  polynomials 
Before  doing  so,  however,  we  make  a few  general  remarks  about 
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solving  the  general  least-squares  problem. 

There  are  several  approaches  to  solving  (5.2).  Perhaps 
the  neatest  is  the  case  where  the  {0^}” 
respect  to  the  inner-product 


are  orthonormal  with 


N 

(5.4)  (0,t)  = Z 0(x  ,y  )t  (x  ,y  ) . 

i=l  1 1 


Then  the  solution  of  (5.2)  can  be  written  down  explicitly  as 
n 


(5.5)  f(x,y)  = Z F 0 (x,y)  . 

j=l  J J 


A second  very  well-known  approach  to  solving  (5.2)  is  via 
the  normal  equations 


(5 . 6)  A*A  a = A*F  , 

T 

where  F = (F^,  ...,F  ) is  the  vector  of  data  values,  and  where 


(5.7) 


A = 


(VvV) 


n , N 
j=l,i=l* 


In  some  cases  the  normal  equations  are  a perfectly  acceptable 
way  to  compute  least- squares  approximation,  but  in  other  cases 
the  system  (5.6)  may  be  ill-conditioned  (or  even  singular--cf . 
the  following  subsection  for  spline  least  squares) . This  ap- 
proach is  also  not  convenient  should  side  conditions  be  desired 
(e.g.,  by  imposing  actual  interpolation  at  some  of  the  values). 
For  more  on  the  normal  equations,  see  any  book  on  Numerical 
Analysis . 

A more  modem  method  of  solving  least-squares  problems  is 
to  use  general  matrix  methods.  Specifically,  consider  the  ob- 
servation equations 

(5.8)  A a = F. 

It  can  be  shown  that  by  applying  a series  of  matrix  transfor- 
mations to  this  system,  one  can  obtain  a set  of  equations  giving 
the  vector  a*.  For  a complete  description  of  methods  of  this 
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type  see  Lawson  and  Hanson  [ 1 29 J or  Stewart  [187].  Matrix 
methods  are  quite  amenable  to  the  adding  of  side  conditions  and 
can  also  be  designed  to  take  account  of  rank-deficiency  of  the 
matrix  A (which  corresponds  to  the  case  of  singular  normal 
equations) . 

Polynomial  discrete  least-squares  fitting  has  been  widely 
used  for  fitting  surfaces  to  data,  both  scattered  and  regular. 
Several  authors  have  developed  algorithms  for  polynomial  dis- 
crete least-squares  fitting  of  scattered  data  by  constructing 
orthonormal  polynomials  (e.g.  by  Gram- Schmidt  orthonormaliza- 
tion) . See,  for  example,  Cadwell  and  Williams  [42],  Crain  and 
Bhattacharyya  [61],  and  Whitten  [201,202].  The  latter  contains 
a FORTRAN  program. 

When  the  data  are  more  regularly  distributed,  polynomial 
least-squares  fitting  can  often  be  simplified.  For  example,  if 
the  data  lie  on  a grid  as  in  (3.4) -(3. 6),  then  the  desired  or- 
thogonal polynomials  are  simply  products  of  the  one -dimensional 
orthogonal  polynomials  associated  with  the  one-dimensional  inner 
products  corresponding  to  [xJq  * and  [yjg  * respectively;  e.g., 
see  Cadwell  [41]  or  Clenshaw  and  Hayes  [56],  as  well  as  the  sur- 
vey papers  of  Hayes  [105,  108,  109]. 

There  are  also  special  methods  for  handling  data  which  are 
not  on  a grid  but  instead  lie  on  parallel  straight  lines.  For 
example,  Clenshaw  and  Hayes  [56]  have  developed  methods  using 
expansions  in  terms  of  Tchebycheff  polynomials  (although  the 
method  actually  only  produces  an  approximation  to  the  least- 
squares  fit  rather  than  the  actual  minimum) . 

Polynomial  least  squares  can  also  be  interpreted  as  multi- 
dimensional regression  as  practiced  by  statisticians,  e.g., 
see  Effroymson  [75],  For  example,  if  we  are  trying  to  fit  a 
function  in  the  form 
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dx  dy  , 

f (x,y)  = Z Z a x yJ, 
i=0  j=0  J 

then  by  defining  new  variables  by 

V M-  V = 0, 1,  . . .,dx 

ZV(dy+l)  +(j.  - X y » 4 = 0,1,  ...,dy 

we  can  write 

d 

f(x,  y)  = b.z,  , d = dxdy  + dx  + dy, 

i=0  1 1 

and  the  problem  becomes  one  of  fitting  a linear  function  in 
several  variables. 

We  close  this  section  by  observing  that  in  some  cases  it 
may  be  desirable  to  consider  weighted  least  squares.  In  parti- 
cular, if  we  have  positive  weights  w^  >0,  i = 1,2,  ...,N,  then 
we  may  replace  $ in  (5.1)  by 

N n 2 

I (a)  = Zw  Za  d (x  ,y  )-F 
W i=l  1 J=1  J J 1 1 1 

It  is  interesting  to  note  that  the  interpolation  formula 
of  Shepard  discussed  in  section  3.3  can  be  interpreted  in  terms 
of  weighted  least- squares  fitting.  In  particular,  fix  (x, y) 
in  D,  and  let  r^(x, y)  be  the  distance  from  (x, y)  to  the 
point  (x^yp  as  before.  Now  set  wi  = r and  consider 

least-squares  approximation  by  a constant  c,  using  these 
weights.  Then  one  easily  computes  that  the  least-squares  choice 
of  c is 


N 

N 

Z w F 

Z F r' 

i 1 1 

1 1 1 

N 

N 

Z w. 

Z r"^ 

, i 

This  approach  was  adopted  by  Pelto,  Elkins  and  Boyd  [152]  (as 
pointed  out  to  me  by  Chuck  Duris) . 
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5.2.  Discrete  Least- squares  fitting  by  splines.  As  outlined 
in  the  previous  subsection,  discrete  least  squares  can  be  car- 
ried out  with  any  finite  set  of  functions.  It  is  not  surpris- 
ing that  a number  of  authors  have  tried  using  tensor  product 
splines.  See,  e.g.,  Halliday,  Wall,  and  Joyner  [96],  Hayes  and 
Halliday  [110],  Jordan  [119],  Hanson,  Radbill,  and  Lawson  [97], 
and  Whiten  [199].  Hayes  and  Halliday  have  developed  both  ALGOL 
and  FORTRAN  programs.  It  is,  on  the  other  hand,  perhaps  some- 
what surprising  that  least-squares  fitting  with  splines  can  be 
somewhat  problematical.  We  briefly  discuss  the  method. 

Suppose  that  H = [a, b]  x [c,d]  is  a rectangle  containing 

the  domain  D of  interest.  Let  [x.]!^  ^ and  [y.)f  ^ be  parti- 

l 0 j 0 

k+1  2+ 1 

tions  of  [a,bj  and  [c,d],  respectively,  and  let  [N. ' 

i j i * m y L - n 

be  the  tensor  product  B-splines  discussed  in  section  3.5.  We 
consider  discrete  least-squares  fitting  using  these  (k+m) (i+n) 

B- splines . 

To  explain  how  difficulties  can  arise  with  spline  least- 
square  fitting,  we  observe  that  it  is  quite  easy  for  the  matrix 
A in  the  observational  equations  (5.8)  to  be  rank-deficient. 

On  a trivial  level  this  can  happen  if  for  some  B-spline  N , 
none  of  the  data  points  lies  in  its  support.  This  deficiency 
can,  of  course,  be  easily  removed  by  dropping  this  particular 
B-spline  from  the  set  being  used  to  approximate.  But  rank  de- 
ficiency can  also  occur  in  more  subtle  ways  because  of  the 
local  support  properties  of  the  functions.  This  problem  can  be 
overcome  with  properly  designed  algorithms.  See  Hayes  and  Halli- 
day [110]  for  a careful  discussion  of  spline  least-squares  fit- 
ting.Lawson  and  Hanson  [129]  include  a general  discussion  of 
how  to  handle  rank  deficient  least- squares  problems. 

If  we  operate  in  terms  of  the  normal  equations,  then  it 
may  well  occur  that  the  normal  equations  are  in  fact  singular. 
This  is  again  due  to  the  local  property  of  the  B-splines  corn- 
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bined  with  the  discrete  inner-product.  Even  when  it  is  not 
singular,  the  set  of  normal  equations  can  be  ill-conditioned 
(even  though  it  is  a relatively  sparse  matrix  with  a kind  of 
repeated  band- structure) . 

Discrete  least  squares  can  also  be  carried  out  with  vari- 
ous finite  dimensional  linear  spaces  of  blended  functions.  For 
an  extensive  study  of  such  methods,  see  the  dissertation  of 
Doty  [71]. 


5.3.  Discrete  Z and  Z approximation . 
discrete  least  squares,  we  may  consider 
approximation  problem:  Given  functions 
we  seek  a*  so  that 


Instead  of  performing 

the  following  discrete 

{0.}?  defined  on  D, 
J 1 


N n 

(5.9)  0(a)  = Z | Z a .0  (x  , y ) - F | 

i=l  j=l  J J 1 


is  minimized.  Alternatively,  we  may  minimize 


n 

(5.10)  0(a)  = max  |Za.0.(x.,y.)  - F.|. 

lSi^N  j=l  J J 1 i 1 

These  are  the  usual  Z and  Z^  best  approximation  problems. 
Both  of  these  problems  can  easily  be  reformulated  as  linear 
programming  problems  for  the  determinations  of  the  optimal  a* 
(cf.  Rabinowicz  [160,161]  or  Rosen  [167]).  Reasonable  choices 
for  the  (0j)  would  be  low-degree  polynomials  if  D is  small, 
or  possibly  spline  functions. 

Discrete  approximation  methods  of  this  type  have  had  rela- 
tively little  exposure  in  the  literature.  For  some  results 

using  tensor  product  splines  in  the  Z problem,  see  Rosen. 

00 

The  optimal  a*  was  obtained  there  by  using  the  standard  sim- 
plex method  on  the  associated  dual  linear  programming  problem. 

The  Z problem  can  also  be  solved  by  using  Remez-type 

00 

algorithms.  For  an  algorithm  which  performs  generalized 
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rational  approximation  (and  thus  can  also  be  used  for  polynomi- 
al approximations)  see  Kaufman  and  Taylor  [120],  Theoretical 
considerations  for  Tchebycheff  approximation  in  several  vari- 
ables can  be  found  in  Collatz  [58]  or  Weinstein  [196],  for  ex- 
ample. 

5.4.  Spline  smoothing  (scattered  data) . In  this  section  we 
consider  some  minimization  problems  similar  to  those  discussed 
in  section  3.4,  but  where  the  class  of  admissible  functions  is 
not  required  to  interpolate  and  where  the  functional  to  be  mini- 
mized includes  a term  measuring  how  close  the  function  comes  to 
fitting  the  data.  To  be  more  specific,  suppose  X is  a linear 
space  of  "smooth"  functions  and  that  8 is  a functional  on  X 
which  measures  the  smoothness  of  an  element  in  X.  Suppose  in 
addition  that  E is  a functional  defined  on  X which  measures 
how  well  a function  fits  the  data.  Then  the  spline-smoothing 
problem  is  the  following: 

(5.11)  Find  s e X such  that  p(s)  = inf  p(u), 

ueX 

where 

(5.12)  p(f)  = 0(f)  + E (f)  . 

The  abstract  theory  of  spline  smoothing  has  been  well 
developed*  see,  e.g.,  the  book  of  Laurent  [127]  and  references 
therein.  To  illustrate  the  ideas,  we  briefly  discuss  a couple 
of  examples.  We  suppose  as  in  section  3.4  that  X is  a semi- 
Hilbert  space  and  that  0 is  a seminorm  on  X with  N = 

[f  e X:  0(f)  = 0).  We  also  suppose  that  X is  actually  a 
function  space  defined  on  a domain  D,  and  that  the  point  eval- 
uators at  f(x^,y^)]^  are  bounded  linear  functionals  on  X. 

We  define 


(5.13)  E ( f)  = p £ [f (x  ,y  ) - F T, 
i-1  11  1 

where  p is  a fixed  positive  constant.  Then  it  can  be  shown 
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(cf.  Duchon  [72,73])  that  the  solution  of  Problem  (5.11)  is  a 
spline  which  can  be  written  in  the  form  (3.20),  where  now  the 
coefficients  are  determined  from  the  linear  system 


N 

£ K((x  ,y  ) ; (x  , 
i=l  J J 


u 

))a  + E b p (x  ,y  ) + a /p  = F , 
1 1 x=\  1 1 J J J J 


(5 . 14) 


£ a,pv(x.,y.)  = o, 
i=l  1 


j - 1,  2,  . . . , N, 
k = 1,2,  . . .,d. 


As  in  section  3.4,  the  application  of  this  method  depends 
on  constructing  a reproducing  kernel  K.  If  0 is  chosen  as 
in  (3.22),  Atteia  [10-12]  and  Thomann  [192,193]  considered 
spline  smoothing  for  spaces  of  smooth  functions  on  the  rectangle 
and  on  the  disc  (the  latter  even  contains  ALGOL  programs) . 

2 

Duchon  [72,73]  considers  similar  problems  defined  on  D = R . 

A similar  spline- smoothing  problem  has  also  been  consider- 
ed by  Pivorarova  [154],  where  0 is  taken  to  be 

(5.15)  0(f)  = //[D2f]2  + [D2f]2. 

X y 

See  also  Kubik  [123]. 

5.5.  Smoothing  splines  (gridded  data) . In  section  3.5  we  con- 
sidered several  minimization  problems  whose  solutions  led  to 
interpolating  polynomial  splines  (and  generalizations) . In  con- 
junction with  the  development  of  interpolating  splines  for 
gridded  data,  there  was  a concurrent  development  of  smoothing 
splines.  For  example,  instead  of  minimizing  the  integral  0 
in  (3.29)  over  appropriate  smooth  interpolating  functions,  we 
may  minimize  instead  p(f)  = 0(f)  + pE(f),  where  E is  given 
by 

(5.16)  E (f)  = E E [f(x  ,y  ) - F ]2. 

i=0  j=0  1 J 

For  results  in  this  direction,  see  e.g.  Nielson  [149,150].  For 
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9 given  by  (3.29),  the  smoothing  splines  are  again  polynomial 
splines.  Again,  more  general  linear  differential  operators  and 
more  general  linear  functionals  can  be  considered. 

5.6.  Continuous  least  squares.  The  method  of  continuous  least 
squares  is  not  directly  suited  to  fitting  surfaces  to  discrete 
data,  but  it  can  be  of  use  as  a second-stage  process,  so  we 
briefly  review  it.  We  suppose  now  that  F is  a function  de- 
fined on  D which  we  wish  to  approximate,  and  that  {0^}^  are 
given  functions  on  D.  We  define 

(5.17)  <f,g)  = //  f (x,  y)  g (x,  y)  dxdy,  ||f||2  = (f,  f) 

D 

and 

(5.18)  ®(a)  = ||  Z a 0 - f||2. 

j=l  J J 

The  problem  is  to  find  a*  to  minimize  <I>(a).  The  solution  is 
given  by  solving  the  normal  equations 

(5 . 19)  Aa  = r, 
where 

A = ((0i,0j»J#j=1  and  r = [ (0^),  . . .,  <0n,F)]T. 

For  reasonably  nice  approximating  functions  it  is  often 
possible  to  compute  the  normal  matrix  exactly.  In  practice, 
the  difficulty  lies  in  evaluating  the  right-hand  sides.  Gener- 
ally a quadrature  formula  is  required  for  this.  One  advantage 
of  the  method  would  be  that  if  several  data-fitting  problems 
are  to  be  solved  using  the  same  set  of  approximating  functions, 
one  can  do  the  work  of  inverting  the  normal  matrix  just  once 
and  re-use  the  result  as  many  times  as  desired. 

Reasonable  choices  for  the  approximating  functions  include 
polynomials,  or  better  yet,  tensor  product  B- splines  as  in 
(3.30).  Here  the  singularity  problems  do  not  crop  up  for  the 
splines  because  we  are  integrating  instead  of  summing  over 
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finitely  many  points.  The  normal  matrix  in  this  case  has  a kind 
of  repeated  band  structure.  The  entries  can  be  computed  exactly, 
e.g.,  by  Gaussian  quadrature  (cf.  de  Boor,  Lyche  and  Schumaker 
[38]) . Uniform  best  approximation  by  tensor  products  of  splines 
has  also  been  considered,  e.g.,  see  Sommer  [182], 

6.  Local  approximation  methods 

As  pointed  out  at  the  beginning  of  section  4,  there  are 
many  advantages  which  accrue  if  one  uses  local  methods  rather 
than  global  ones.  In  this  section  we  discuss  some  local  approxi- 
mation schemes. 

6.1.  Patch  methods.  As  in  the  case  of  interpolation,  the  sim- 
plest approach  to  obtaining  local  approximation  methods  is  to 
partition  the  domain  and  to  define  a surface  (patch)  on  each 
subdomain  separately.  In  particular,  suppose  that  D = U[D^,}p 
where  are  disjoint  subsets  of  D.  Then  we  may  seek  f in 

the  form 

(6.1)  f(x,y)  = ( f i (x,  y)  , (x,y)  e i = 1,2, ...,d. 

To  construct  the  patch  f^(x,  y),  we  might  use  the  data  available 
in  the  subregion  D^.  In  certain  cases,  however,  it  may  well  oc- 
cur that  no  data  at  all  are  available  in  the  set  . In  this 
case  we  may  choose  a somewhat  larger  set  D.  of  points  "near" 

'Vi 

D^,  and  use  the  data  in  to  construct  f^.  For  any  given 

method,  it  should  be  possible  to  make  the  choice  of  adaptive 

so  that  the  size  of  EL  is  kept  as  small  as  possible  consistent 

with  the  amount  of  data  desired  for  the  construction  of  f.. 

l 

The  patch  method  outlined  above  can  be  used  with  any  of  the 
approximation  methods  discussed  in  section  5.  For  example,  one 
might  choose  to  use  polynomials  (of  low  order),  and  to  do  dis- 
crete least- squares  approximation.  Or,  one  might  use  or 

^ approximation  or  some  other  convenient  space  (e.g.  splines) 
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instead  of  polynomials.  The  main  point  is  to  keep  the  size  of 
each  individual  patch  problem  (and  thus  the  size  of  the  corre- 
sponding system  of  equations)  small.  We  may  have  to  solve  a 
lot  of  systems  of  equations,  but  each  will  be  small  and  fairly 
we 11- conditioned. 

To  illustrate  how  the  adaptive  feature  might  be  implemented, 
suppose  that  the  domain  D of  interest  has  been  enclosed  in  a 
rectangle  H and  that  a partition  of  H is  defined  by  H = 

utHiJ)i=o;Ji=o  * with  Hij = [xi'xi+i]x  [yj'yj+i]  for  some 
(6.2)  a = x0  < xL  <...<  xk+1  = b,  c = y0<y1<---<y^1  = d- 

Now  suppose  that  we  want  to  do  discrete  least-squares  fitting 

using  a patch  of  the  form  f^^(x, y)  = a + bx  + cy  on  H . 

In  this  case  it  would  be  reasonable  to  require  that  at  least 

3 pieces  of  data  should  be  used  to  construct  f.  ..  If  H.  . 

U 1J 

does  not  contain  3 pieces  of  data,  we  expand  H to  H_  by 
adding  all  bordering  rectangles.  If  this  does  not  contain 
3 pieces,  we  again  add  all  bordering  rectangles,  etc.  We  then 
compute  the  discrete  least-squares  polynomial  using  the  data  in 
H.  but  then  we  use  the  resulting  function  only  in  H.  ..  The 
process  may  be  repeated  to  define  each  required  patch.  This 
kind  of  adaptive  algorithm  is  very  easy  to  program. 

In  using  patch  methods  to  get  local  interpolation  methods, 
we  concentrated  on  methods  using  data  at  comers  of  triangles 
or  rectangles,  and  by  choosing  appropriate  forms  for  the  patches, 
it  was  possible  to  get  the  individual  patches  to  match  together 
to  give  a continuous  global  surface  (or  with  more  sophisticated 
patches,  even  C^(D)  or  higher).  Here,  however,  where  the  in- 
dividual patches  are  determined  by  approximation,  it  is  not 
very  likely  that  the  patches  will  match  up,  and  the  global  sur- 
face will  generally  not  even  be  continuous.  For  most  applica- 
tions, this  is  a serious  drawback.  However,  as  we  shall  see  in 
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section  7,  patch  approximation  methods  can  still  be  very  useful 
as  first- stage  methods. 

6.2.  Direct  local  methods.  In  this  section  we  discuss  some 
local  methods  in  which  the  approximating  surface  is  constructed 
directly  from  the  data  without  solving  any  systems  of  equations. 
It  will  be  convenient  to  pose  a more  general  problem  than  pre- 
viously considered. 

Let  & be  a linear  space  of  functions  defined  on  D,  and 
suppose  that  {A^}^  are  linear  functionals  defined  on  Let 

N 

{(?i } i be  a prescribed  set  of  functions  defined  on  D.  Then  we 
are  interested  in  approximation  schemes  of  the  following  form: 

N 

(6.3)  QF(x,y)  = Z A F0  (x,  y)  . 

i=l  1 L 

We  can  think  of  this  as  a surface- fitting  problem  where  the 
data  are  given  by  F^  = A^F,  i = 1,2, . ,.,N.  Given  the  data, 
we  can  write  the  approximation  down  immediately. 

We  also  observe  that  if  the  0^  have  support  on  small  sub- 
sets of  D,  and  if  each  A^  also  has  support  on  the  same  set, 
then  the  formula  (6.3)  is  local.  For  example,  if  we  take  A^ 
to  be  point  evaluation  at  the  point  (x^,y^)  and  0^(x,  y)  to 
be  a function  with  support  in  a neighborhood  of  (x^,y^),  then 
the  approximation  formula  simply  becomes 
N 

(6.4)  QF  (x,  y)  = ZF0  (x,y). 

i=l 

This  is  very  reminiscent  of  the  Lagrange  form  of  interpolation 
(cf.  (4.3)),  but  unless  the  0^  are  taken  to  satisfy  (4.2), 

QF  will  not  in  fact  be  an  interpolant.  For  this  reason,  for- 
mulae of  the  form  (6.4)  (or  more  generally  (6.3))  are  sometimes 
referred  to  as  quasi-interpolants . Local  quasi-interpolants 
of  the  form  (6.3)  can  be  constructed  simply  by  defining  the 
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N 

functions  {0^}^  with  local  supports.  If  each  of  these  is 
continuous  (or  smooth),  then  QF  will  be  also. 

Although  a host  of  quasi-interpolants  can  be  constructed 
as  outlined  above,  considerable  care  must  be  exercised  in  order  - 

to  get  methods  which  give  good  accuracy  (when  the  original 
function  F is  smooth) . As  observed  earlier,  this  is  directly 
related  to  making  the  method  exact  for  polynomials,  i.e.,  such 
that  QP  = P for  all  P in  some  class  of  polynomials. 

To  construct  methods  of  the  form  (6.3)  which  apply  to 

N 

scattered  data,  it  is  necessary  to  construct  appropriate  [0^]^. 

While  a host  of  methods  can  be  constructed  this  way,  it  is  not 
so  easy  to  choose  the  0^  to  make  the  method  exact  for  poly- 

t 

nomials  (which,  as  we  remarked  earlier,  is  directly  related  to 
how  well  the  method  will  approximate  smooth  functions  F) . To 

get  methods  which  do  have  a reasonable  degree  of  exactness  (and  ; 

a correspondingly  good  error  bound  for  smooth  functions),  it  is 

easier  to  first  choose  the  [0. )^,  and  then  try  to  find  suit-  j 

^ 1 i- 

able  [A  . While  this  generally  rules  out  using  point  evalu-  \ 

ators  at  scattered  data,  it  is  possible  to  construct  methods  ' 

based  on  point  evaluators  at  appropriate  points,  and  such  meth-  j 

ods  can  be  useful  as  second-stage  approximations. 

To  illustrate  these  ideas,  we  consider  construction  of 
local  spline  approximation  methods  following  the  general  treat- 
ment in  Lyche  and  Schumaker  [131].  Suppose  D is  enclosed  in 

a rectangle  H,  and  that  H is  partitioned  into  subrectangles 

k Z 

by  a grid  as  in  (6.2).  Suppose  that  [N,  .}  , * , are  the 

r ij  1-m, 1-n 

tensor  product  B-splines  associated  with  this  partition  (cf. 

(3.30)).  We  are  now  interested  in  approximation  schemes  of  the 
form 

k 2 

(6.5)  QF (x,  y)  = Z Z A FN  (x, y) . 

i=l-m  j=l-n  1J  J 

In  particular,  we  are  going  to  consider  the  question  of 
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constructing  formulae  of  this  type  which  are  exact  for  the  class 

of  polynomials  u > with  some  fixed  1 < v < m and  1 < u < n. 

This  problem  has  a very  simple  algebraic  solution  if  we  decide 

to  construct  each  A. . in  the  form 

ij 

v u 

(6.6)  A = Z Z a . AX.  A*  , 
ij  v=  1 ll=1  LJV  iju 


where  the  [A*  }V  .. 

ijv  V=1 


and  [A^.  )U  . are  linear  functionals 


ijuVl 

which  apply  to  functions  of  x and  y alone,  respectively.  It 

can  be  shown  (cf.  [131])  that  given  any  [A. . ] and  {A. . } sa- 

ijV  ljU 

tisfying  mild  independence  assumptions,  there  exist  coefficients 

{a.  . } such  that  the  formula  (6.5)  will  be  exact  for 

l j Vp  vu 

In  fact,  these  coefficients  can  easily  be  explicitly  computed. 

To  give  one  example,  suppose 

r 


(6.7)  < 


(x.  , +. . . +x  .) 
. l + l i+m-r 

Ji  " (m-1) 

Ij  " (n-1) 


i = 1-m, . . .,k 
j = 1-m, 


Then  we  obtain 


k 1 

(6.8)  QF (x,  y)  = Z Z F(g  ,T|  )N  (x,y), 
i=l-m  j-l-n  1 J 

a formula  which  exactly  reproduces  the  bilinear  polynomials  P 

*■ 

This  is  the  multidimensional  (tensor  product)  version  of  the 
Variation  Diminishing  method  of  Marsden  and  Schoenberg;  it  was 
studied  in  some  detail  in  Munteanu  and  Schumaker  [145J.  This 
formula  is  closely  related  to  the  Bezier-type  surfaces  construc- 
ted in  Riesenfeld  [163]  (see  also  Gordon  and  Riesenfeld  [89]). 

We  should  observe  that  the  way  formula  (6.5)  now  stands,  it 
may  involve  information  on  F which  is  taken  from  data  outside 
of  the  domain  D.  This  situation  can  be  rectified  as  follows: 
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Let 

(6.9)  Q = {(i,j):  support  A^  . D D not  empty). 

Then  It  can  be  shown  [131]  that  the  method 

(6.10)  QF (x,  y)  = ZZ  A FN  (x,  y) 

(i,j)eu 


remains  exact  as  long  as  all  functions  are  restricted  to  D. 

To  get  higher-order  methods,  depending  only  on  point  eval- 
uations, we  proceed  as  follows.  Choose 


x,  < 

x 

T . . < 

> 

CSI 

II 

(6.11) 

i 

ijv 

i-Hn* 

/ / / 

yj< 

T^  . < 

yj+n' 

la  = 1,2,  ...,u 

for  i 

= 1-m, 

...,k 

and  j 

= 1-n, 

X -\  V 

to  be  point  evaluation  at  t. . and  A^ . to  be  point  evalu- 

ijv  i Jn 

ation  at  xy  , the  coefficients  in  (6.6)  are  easily  computed, 
ijir 

Hints  on  where  the  t's  should  be  placed  within  the  support 
of  the  B-splines  are  given  by  the  error  analysis  in  [131). 

We  close  this  section  with  some  historical  remarks  on  the 


development  of  local  approximation  schemes  in  two  dimensions. 
Early  papers  include  Babuska  [14],  de  Boor  and  Fix  [37 J,  and 
Fix  and  Strang  [ 80 J . For  some  methods  involving  triangular 
partitions,  see  Fredrickson  [82].  Quasi-interpolants  were 
constructed  in  de  Boor  and  Fix  [37]  using  point  evaluation 
data,  but  including  derivatives.  We  have  followed  Lyche  and 
Schumaker  [131]  where  general  linear  functionals  are  consider- 
ed, and  where  in  particular,  methods  can  be  constructed  using 
only  point  evaluation  of  the  function.  (Local  integrals  etc. 
would  also  be  possible.)  The  papers  [37]  and  [131]  both  con- 
tain extensive  error  bound  analyses.  It  is  striking  that  these 
local  spline  approximation  methods  give  optimal  order  error 
bounds  for  smooth  functions. 
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7 . Two-stage  processes 

Many  of  the  methods  we  have  discussed  in  this  paper  are 
only  applicable  when  the  data  are  regularly  spaced  (and  in  fact, 
many  surface- fitting  methods  require  specification  of  derivative 
data  as  well  as  function  values) . Such  methods  cannot  be  ap- 
plied directly  to  the  scattered  data-fitting  Problem  1.1.  On 
the  other  hand,  some  of  the  most  convenient  local  interpolating 
and  local  approximating  methods  which  do  work  for  scattered 
data  produce  surfaces  which  are  not  globally  smooth  (or  even 
continuous).  Thus,  it  seems  natural  to  consider  the  possibility 
of  constructing  two-stage  processes  in  which  the  first  stage 
uses  the  scattered  data  to  construct  an  approximation  g,  while 
the  second  stage  uses  g to  generate  data  for  constructing  a 
surface  f (with  desirable  properties,  such  as  smoothness) . 

Since  it  is  quite  clear  how  various  methods  discussed  in 
the  earlier  sections  might  be  put  together  to  yield  two-stage 
processes,  it  will  suffice  to  mention  just  a couple  of  examples 
here. 

7.1.  Interpolation/ interpolation.  Suppose  that  we  want  to  con- 
struct a piecewise  bicubic  surface  based  on  data  given  on  a 
rectangular  grid  as  in  (3. 4) -(3. 6).  In  each  subrectangle  . 
the  16  coefficients  of  the  bicubic  f (cf.  (4.6))  would  be  de- 
termined by  the  values  of  f,  f , f , and  f at  each  of  the 

7 x y xy 

four  corners.  Now  since  our  original  data-fitting  problem  only 

specifies  the  values  of  the  function  at  the  grid  points,  local 

interpolation  cannot  be  carried  out  directly.  However,  we  can 

use  the  data  to  provide  estimates  for  the  values  of  f^,  f^, 

and  f at  the  grid  points  (i.e.,  we  construct  g interpolating 
xy 

the  data);  then  we  can  use  local  bicubic  interpolation  as  a 
second  stage.  The  reader  will  have  no  difficulty  in  imagining 
ways  to  produce  estimates  for  these  quantities.  For  some  meth- 
ods which  appear  in  the  literature,  see  the  papers  of  Akima 
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[5,6],  Koelllng  and  Whitten  [121],  and  Spath  [183]. 

7.2.  Approximation/ interpolation.  Instead  of  making  the  first- 
stage  process  interpolation  as  in  section  7.1,  it  would  also  be 
possible  to  use  an  approximating  process.  For  example,  one 
might  use  least-squares  polynomial  approximation  to  construct 

a patch  surface  and  then  use  some  convenient  interpolation  pro- 
cess as  a second  stage.  For  an  example  of  this  type,  see  Mess- 
ing et  al  [114 ] . 

7.3.  Approxima tion/approximation.  This  combination  is  parti- 
cularly convenient  if  we  are  not  concerned  about  getting  an  in- 
terpolating function.  Both  stages  can  be  made  local.  To  give 
an  example,  recently  I have  constructed  an  algorithm  for  fitting 
surfaces  to  scattered  data  in  which  the  first  stage  consists 

of  polynomial  least- squares  patch  approximation  (with  adaptive 
choice  of  data--see  section  6),  and  where  the  second  stage  con- 
sists of  direct  local  tensor  product  spline  approximation.  Both 
stages  are  local,  and  the  final  surface  is  a tensor  product 
spline.  Since  the  second  stage  is  a direct  method,  it  is  very 
cheap  to  apply.  Experiments  with  real-life  data  (e.g.  from 
heart  potentials,  potential  fields,  and  geological  maps--see 
section  2)  have  produced  very  promising  results.  Details,  in- 
cluding an  analysis  of  error  bounds,  will  appear  elsewhere.  I 
have  also  tried  alternate  versions  where  the  patches  are  con- 
structed as  low-order  polynomials  which  are  best  approximations 
in  the  or  H sense  (via  linear  programming)  again  with  adap- 
tive  choice  of  data.  The  results  were  very  similar.  Finally, 

I have  also  experimented  with  computing  patch  approximations, 
followed  by  continuous  least-squares  tensor-product  spline  ap- 
proximation. Again,  the  experiments  were  promising. 

8.  Contouring 

As  indicated  in  the  introduction,  frequently  the  goal  in 
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fitting  a surface  f to  data  is  to  construct  a contour  map 
which  approximates  the  contour  map  of  the  unknown  surface  F 
which  produced  the  data.  In  this  section  we  discuss  some  methods 
for  constructing  contour  maps  of  a surface  f. 

8.1.  Piecewise  linear  functions  on  triangles.  When  the  func- 
tion f to  be  contoured  is  a piecewise  linear  function  defined 
on  triangles  (and  globally  continuous),  locating  contours  re- 
duces essentially  to  a matter  of  good  bookkeeping.  Indeed,  if 
H is  the  height  of  the  contour  of  interest,  then  it  is  easily 
seen  that  for  a given  triangle  T with  vertices,  A,  B,  and  C, 

(8.1)  the  contour  does  not  pass  through  T if  H < min(f(A), 
f(B),  f(C))  or  if  H > max  (f  (A) , f (B) , f (C) ) 

and 


(8.2)  the  contour  intersects  exactly  two  sides  of  T otherwise. 

If  case  (8.2)  holds,  it  is  easy  to  determine  which  two  sides 
are  intersected  and,  moreover,  by  using  inverse  linear  interpo- 
lation between  vertex  values,  the  points  on  these  sides  where 
the  contour  crosses  can  be  determined.  Specifically,  if,  for 
example, 

f (A)  < H < f (B)  , 

then  the  contour  crosses  the  line  from  A to  B at  the  point 
on  the  line  which  is  a distance  of 


-MSSto)- 

(f(B)-f(A)) 


B - A 


from  A.  Given  the  points  on  two  sides  of  a triangle  where  the 
contour  line  crosses,  we  can  now  draw  the  contour  line  since  it 
is  simply  a straight  line  between  the  points.  An  algorithm  to 
carry  out  this  procedure  requires  enumerating  the  triangles  and 
vertices  and  some  kind  of  effective  search  procedure.  There 
are  several  available  in  the  literature.  For  ALGOL  programs. 
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see  Heap  [111,112].  (An  earlier  paper  of  Heap  and  Pink  [113] 
contains  a similar  FORTRAN  program  but  only  for  regular  triangu- 
larizations.)  Lawson  [128]  discusses  a similar  algorithm.  The 
algorithms  mentioned  include  two  possible  approaches:  (1)  one 

may  start  with  a triangle  where  it  is  known  the  contour  inter- 
sects, and  trace  this  contour  as  far  as  it  goes,  or  (2)  one  may 
simply  draw  the  contour  lines  in  all  triangles  which  have  them. 

8.2.  Piecewise  bilinear  functions  on  rectangles.  Suppose  now 
that  the  function  f to  be  contoured  is  a piecewise  (continuous) 
function  on  a rectangle  partitioned  into  subrectangles  by  a grid. 
Since  f is  linear  in  x or  y on  the  edges,  it  follows  that 
we  can  again  determine  whether  a contour  line  of  height  H 
crosses  an  edge  by  inverse  linear  interpolation.  There  is  in 
this  case,  however,  a serious  difficulty  which  does  not  arise 
in  the  case  of  triangles.  It  may  happen  that  the  height  H 
lies  on  three  or  even  four  sides  of  the  rectangle.  In  this 
case,  it  is  possible  that  two  different  contour  lines  pass 
through  the  rectangle,  and  it  is  not  clear  how  to  interconnect 
the  points  (see  Figure  6)  . 


Figure  6.  Two  Contours  in  a Rectangle 
Put  another  way,  if  we  are  following  a contour  and  enter  a rec- 
tangle as  shown  above  in  Figure  6 on  the  bottom  line,  then  it 
is  not  clear  whether  we  should  now  turn  right  or  turn  left.  One 
approach  to  designing  an  algorithm  in  this  case  is  to  simply 
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always  go  right,  say,  even  though  this  may  in  the  end  be  wrong. 
(If  it  is,  we  have  to  start  over  with  a coarser  mesh.)  This 
technique  was  incorporated  in  an  algorithm  by  Heap  [111, 112]  — 
the  paper  contains  a FORTRAN  program.  (An  earlier  ALGOL  pro- 
gram can  be  found  in  Heap  and  Pink  [113], 

A second  approach  to  overcoming  the  ambiguity  is  to  com- 
pute an  approximation  to  the  value  of  f at  the  center  of  the 
rectangle  (e.g.,  by  taking  the  average  of  the  four-corner  val- 
ues) and  then  to  triangulate  the  rectangle.  This  amounts  to  a 
second-stage  approximation  process,  and  the  surface  contoured 
is  no  longer  f itself  but  an  approximation  g.  This  idea  was 
programmed  in  ALGOL  in  Heap  and  Pink  [113]  and  in  FORTRAN  in 
Heap  [111,112], 

Once  the  set  of  points  for  a particular  contour  have  been 
found,  there  are  a variety  of  ways  of  drawing  a contour  line 
through  these  points.  One  possibility  is  to  simply  draw 
straight  lines  between  each  of  the  points.  The  actual  contour 
lines  are  expressions  of  the  form  y = (a+bx) / (c+dx)  in  each 
rectangle.  These  are  generally  not  straight  lines.  Hence,  if 
smoother  contours  are  desired,  one  may  use  any  one  of  a number 
of  methods  for  drawing  a smooth  curve  through  an  ordered  set  of 
points  in  the  plane.  For  example,  the  curve  could  be  computed 
in  parametric  form  using  one- dimensional  splines.  Another  pos- 
sibility would  be  to  use  the  Bezier  methods  with  either  Bern- 
stein polynomials  or  with  B-splines  (cf.  Gordon  and  Riesenfeld 
[89]  and  Riesenfeld  [163]),  although  in  this  case  the  curves 
will  not  exactly  go  through  the  points.  For  other  algorithms 
see  Marlow  and  Powell  [138]  or  McConalogue  [139]. 

8.3.  Piecewise  quadratics  on  triangles.  Suppose  now  that  f 
is  a piecewise  quadratic  defined  on  a triangular  partition.  In 
this  case  a contour  line  at  height  H passing  through  a trian- 
gle must  be  described  by  a conic  section.  Such  a section  can 
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be  represented  in  parametric  form  as 

x(t)  = (b0+b1t+b2t2)/(b^b4t  + b5‘2) 

y(t)  = (b& + byt +bgt2)/(b3  + b4t  + b5t2), 

see  Powell  [156],  Powell  has  promised  an  algorithm  based  on 
this  observation. 

We  turn  now  to  some  methods  for  handling  general  functions 
f on  arbitrary  domains  D. 


8.4.  A simple  line-printer  method.  The  following  simple-minded 
method  can  produce  reasonable- looking  contours  without  excessive 
computation,  and  without  recourse  to  a plotter.  Suppose  H is 
a rectangle  enclosing  the  domain  D,  and  that  we  partition  H 
as  H = U th  . with  a rectangular  grid  as  in  (6.2) . Let  HL  < HU 
be  given  real  numbers.  Finally,  suppose  that  t is  some 
point  in  H where  f can  be  evaluated  (perhaps  one  of  the 
corners  or  the  center) . Define 


(8.3) 


0 

9 


if 


f(t..)  < HL 


v , if  HL 


, if  f(t  ) > HU 
(v-l)h  < f(tij)  < HL  + vh,  1 < v < 8, 


for  all  i = 0, 1,  . . ,,k  and  j = 0,  1,  . . .,  1 (where  h = (HU-HL)  /8)  . 
The  (k+2)  by  (i+2)  matrix  C contains  only  integers,  and  if  it 
is  printed  out  without  either  horizontal  or  vertical  spacing, 
we  obtain  a reasonable- looking  contour  map  of  the  function.  A 
typical  example  is  included  in  Figure  7.  The  method  can  be 
refined  by  using  an  alpha-numeric  array  C and  more  than  10 
symbols.  It  can  also  be  refined  by  using  a printer  with  appro- 
priate horizontal  spacing  so  that  each  symbol  occupies  a square 
rather  than  a rectangle  (e.g.,  cf.  Buneman  [40]). 

8.5.  Threading  on  a rectangular  grid.  As  in  section  8.4, 
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Figure  7.  A Simple  Contour  Map  (Heart  Potential) 


suppose  that  D is  imbedded  in  a rectangle  H which  has  been 
partitioned  by  a rectangular  grid  as  in  (6.2).  Assuming  that 
f is  continuous,  it  is  still  possible  to  decide  which  of  the 
grid  lines  a particular  contour  of  height  H crosses  by  examin- 
ing the  end-points  of  each  such  line.  Since  f is  not  generally 
linear  along  such  a line,  we  cannot  determine  exactly  where  the 
crossing  point  is  by  linear  inverse  interpolation.  However,  if 
we  are  willing  to  evaluate  f a few  times  along  this  line,  we 
can  estimate  the  crossing  point  quite  accurately  by  bisection, 
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for  example.  Once  a sequence  of  points  on  a contour  has  been 
determined,  we  may  thread  a curve  through  the  points  just  as  in 
section  8.2. 

This  method  does  have  one  serious  drawback,  however, -- 
just  as  with  the  method  discussed  in  section  8.2--,  if  we  are 
tracing  a contour  it  may  happen  that  after  entering  a triangle 
there  is  an  ambiguity  as  to  which  of  two  points  to  use  to  exit 
the  rectangle.  One  could  opt  for  an  ad  hoc  rule  or  try  the 
second-stage  approximation  described  in  section  8.2.  For  an 
example  of  how  this  method  works,  see  Falconer  [76]  (based  on 
Lodwick  and  Whittle  [130]),  where  it  is  applied  to  a surface 
constructed  by  local  weighted  quadratic  polynomial  least- squares 
approximation.  Since  bisection  is  involved,  one  should  realize 
that  in  drawing  contours  with  this  routing  the  surface  f is 
going  to  be  evaluated  a great  many  times. 

8.6.  Threading  on  a triangular  grid.  An  obvious  cure  for  the 
ambiguity  discussed  in  section  8.5  for  threading  on  a rectangu- 
lar grid  is  to  use  a triangular  partition  in  the  first  place. 
Then  the  bisection  method  coupled  with  a threading  routine  leads 
immediately  to  a contouring  routine  for  general  surfaces  f. 
Strangely  enough,  I have  not  been  able  to  find  anywhere  where 
this  method  has  been  suggested. 

I have  made  no  effort  to  track  down  all  available  papers 
on  contouring.  A few  which  I did  find  and  have  not  yet  men- 
tioned are  Cottafawa  and  le  Moli  [60],  Dayhoff  [64],  and  Pelto 
et  al  [152].  There  are  many  others. 

In  some  cases  it  may  be  desirable  to  have  a more  graphic 
picture  of  a surface  than  a contour  map  can  provide.  Recently 
there  has  been  considerable  effort  devoted  to  computer  methods 
for  displaying  surfaces  on  a scope  or  with  a plotter.  For  some 
examples  of  output  and  a discussion  of  methods,  see  e.g.  the 
book  by  Barnhill  and  Riesenfeld  [20]  on  computer-aided  design. 
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If  an  actual  3-D  picture  is  desired  instead  of  just  a perspec- 
tive, it  is  even  possible  to  produce  holographs. 
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SIGNALS  WITH  A FINITE  SPECTRUM 
A.  G.  Vitulskin 

Functions  of  real  variables  whose  Fourier  transforms 
have  finite  support  are  often  called  "signals  with  a finite 
spectrum."  Sound  signals,  radio  signals,  areal  photographs, 
holograms  and  so  on  are  examples  of  signals  of  this  kind. 
Coding  of  a signal  usually  consists  in  its  discrete  repre- 
sentation. In  the  simplest  case,  this  would  be  a certain 
sequence  of  binary  numbers  from  which  the  original  function 
can  be  recovered. 

The  discrete  form  of  representation  rather  than  the 
continuous  (or  the  so-called  analogue)  representation  provides 
more  ways  of  protecting  the  signal  against  various  kinds  of 
possible  distortion  in  its  retaining  and  transmission.  The 
problem  of  coding  is  then  reduced  to  the  construction  of 
appropriate  approximations.  The  first  requirement  which 
these  approximations  must  satisfy  is  that  the  number  of  binary 
parameters  defining  a function  should  be  as  small  as  possible. 

It  is  known  that  the  estimation  of  the  length  of  the 
most  economical  code  can  be  reduced  to  the  calculation  of 
the  entropy  of  the  corresponding  function  class.  We  will 
present  here  the  main  results  concerning  the  calculation  of 
the  entropy  of  classes  of  entire  functions  and  discuss  one 
of  them  in  detail. 

We  will  also  define  a special  notion  of  closeness  between 
signals  and  present  an  estimate  of  the  density  of  codes  for 
the  corresponding  class  of  functions. 
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1 Representation  of  an  entire  function 
and  Kotel 'nikov's  theorem 

The  concept  of  signals  with  finite  spectrum  is  usually 

associated  with  the  Bernstein  class  B of  entire  functions. 

a 

This  class  consists,  by  definition,  of  the  real-valued  func- 
tions defined  and  bounded  in  absolute  value  by  unity  on  the 
real  line  such  that  the  supports  of  their  Fourier  transforms 
are  contained  in  the  closed  interval  [-o,a].  Hence,  any 
function  f in  B^  is  an  entire  function  satisfying  the 
inequality 

| £ (x) | < ea 1 Im= I 


for  every  complex  z (cf.  [6]).  It  is  known,  by  Kotel 'nikov' s 
theorem  [5],  that  the  information  content  of  a signal  with 
spectrum  a is  proportional  to  a.  V.  A.  Kotel'nikov  has  also 
shown  in  [6]  that  any  square  integrable  function  f in  B^ 
can  be  represented  in  the  form 


f(t)  = 


sin  at 


Z 

k=-°° 


(-l)kf  (— ) 
a 


t - kr/a 


It  follows  from  this  representation  that  the  number  of  param- 
eters (per  unit  time)  defining  the  function  is  proportional  to 
a 


Perhaps,  the  following  representation  for  functions  of 

the  class  B^  is  more  convenient  in  applications.  For  any 

natural  number  p and  f e B , we  have 

a 


f(t)  = (j)(t)sin  at  + 


z (-Dkf(^) 

a . a 

k=~o° 


[ I _ _ _ (C^P+1  p| 

|t  - kir/a  nk  irk  ’’’  mx 
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where  4>(t)  is  some  polynomial  with  degree  not  exceeding  p 
(cf.  [6]). 

2 The  Kolmogorov-Tihomirov  theorem 

There  exist  at  present  more  concrete  forms  of  the  assertion 
of  Kotel'nikov  concerning  the  information  content  of  signals 
with  a finite  spectrum.  We  consider  here  two  of  these  results. 

DEFINITION  2.1  A set  B is  called  an  e-net  of  the  class  B 
a 

on  the  interval  [~T,T]  if  for  any  f e B there  exists  a 

* 0 

function  f*  e B^  such  that  for  any  t e [-T,T] 

|f(t)  - f * (t)  | <_  e. 

We  denote  by  N (B  , T)  the  number  of  elements  of  the  minimal 

e a 

e-net  of  the  set  B on  the  interval  [-T.T1  and  call 

a 


WT)  ' 1o«2  WT) 

the  e-entropy  of  the  set  B^  on  the  interval  [-T,T]. 

The  following  theorem  due  to  A.  N.  Kolmogorov  and  V.  M. 
Tihomirov  can  be  found  in  [4], 

THEOREM  2.1 


lim 
e •>  0 


lim 
T -+ 


H (B  ,T) 

e a 

00  2T  log  1/e 


= lim  lim 
e-K)  T-*» 


WT> 

2T  log  Hz 


a 
ir  ‘ 


Another  model  of  signals  with  a finite  spectrum  is  in 
the  form  of  a random  process  with  a finite  spectrum.  C.  Shan- 
non defined  the  notion  of  entropy  for  a random  process  and 
presented  (without  proof)  the  asymptotic  formulas  for  the 
entropy  for  processes  with  finite  spectrum  [7].  It  is  known 
that  almost  all  realizations  of  a process  of  this  kind  are 
entire  functions  of  exponential  type.  That  is  why  Kolmogorov- 
Tihomirov'  s asymptotic  formula  for  the  entropy  of  Ba  has 
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turned  out  to  be  similar  to  Shannon's.  However,  Shannon's 
equality  for  random  processes  remained  for  a long  period  of 
time  without  proof,  and  it  was  L.  B.  Sofman  who  re- 
fined the  statement  and  proved  it  (cf.  [8]). 

3 The  notion  of  closeness  of  signals 

We  now  turn  to  the  main  subject  of  this  paper,  namely 
the  estimate  of  the  density  of  codes  for  signals  with  a 
finite  spectrum.  First  we  should  refine  the  notion  of  close- 
ness of  signals. 

The  systems  of  measurements  used  in  engineering  problems 
are  sometimes  so  complicated  that  there  is  no  way  to  describe 
them  by  using  only  the  conventional  metrics.  Let  us  consider, 
for  example,  the  system  of  measurements  used  in  sound  recording 
problems  to  describe  the  quality  of  sound  reproduction.  Among 
the  technical  parameters  that  are  used  to  characterize  the 
quality  of  the  recording  equipment,  the  following  are  essential 
from  the  point  of  view  of  information  theory: 

1)  a denotes  the  maximal  reproducible  frequency, 

2)  e is  the  relative  error  of  the  reproduction,  and 

3)  D = 20  log^o(M/6)  is  the  dynamic  range  of  the  equipment. 
Here,  M is  the  norm  of  the  maximal  possible  output  signal 
and  6 is  the  norm  of  the  noise  at  the  output.  The  norm  of 
a signal  f(t)  is  defined  by 

f ^ (x) dx, 

where  r > 0 is  a constant  commensurate  with  1/a. 

Taking  these  parameters  as  a model,  we  will  give  several 
definitions,  the  first  of  which  is  the  definition  of  closeness. 

DEFINITION  3.1.  Let  t,  6 and  r be  fixed  positive  constants 
and  f(t),  f*(t)  be  functions  defined  on  the  whole  real  line. 


f(t) 


= max 
t 


]/r 

t-r 
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We  say  that  the  function  f*  is  close  to  the  function  f 
if  for  any  real  t we  have 

| f ( t ) - f*(t)|  <_  e max  |f(x)|  + 6. 

t-r  x t+r 

We  shall  assume  that  for  functions  in  B the  constant 

o 

r is  commensurable  to  1/a.  The  meaning  of  this  restriction 
will  be  clear  when  we  discuss  the  results  below. 

4 Complexity  of  an  apparatus 

Next,  we  define  the  notion  of  an  apparatus  and  the  param- 
eters that  are  used  to  characterize  its  quality  and  complexity. 
An  apparatus  P is  a pair  of  transformations  and  P^ 

possessing  the  following  properties:  Every  real-valued  func- 

tion f(t)  defined  on  the  whole  real  axis,  called  an  input 
function,  is  transformed  by  the  operator  P^  into  a function 
4>  = <{>(kT,f)  defined  for  all  integers,  k,  where  t is  a 
positive  constant  independent  of  the  input  functions;  the 
function  $ is  allowed  to  take  on  only  the  values  0 and  1. 
In  other  words,  the  operator  P^  associates  with  every  input 
function  f(t)  a sequence  of  binary  numbers  <J>(kx,f), 

(k  = -oo,  ...  ,0,  ...,<») , uniformly  distributed  in  time  with  the 
density  1/t  per  unit  time.  This  sequence  is  called  a 
binary  code  of  the  input  function  f(t).  The  second  operator 
transforms  the  sequence  4> (kx , f ) into  a real-valued 
function  f*  = P(f)  defined  on  the  whole  real  axis  and 
bounded  in  modulus  by  unity. 

In  addition,  it  is  further  assumed  that  there  exists 
a positive  constant  l such  that  for  any  input  function  f(t) 
and  every  integer  k the  value  4>(kx,f)  is  uniquely  de- 
termined by  the  values  of  the  function  f(t)  in  the  interval 
[kT  - £,  kx  + ?,],  and  for  every  t the  value  f*(t)  depends 
only  on  the  values  P^(kx)  for  t-£.<_kx_<t  + t.. 
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The  constant  Z is  called  the  delay  and  the  number 
h = 1/t  the  code  density  of  the  apparatus.  If  the  bounded- 
ness condition  of  the  delay  of  the  apparatus  were  omitted 
from  the  above  definition,  the  notion  of  code  density  would 
not  be  well-defined.  Indeed,  by  stretching  the  code  sequence 
we  can  assign  any  value  to  the  code  density. 

The  parameters  h and  t characterize,  in  a sense,  the 
complexity  of  the  apparatus. 

5 Quality  of  an  apparatus 

To  describe  the  quality  of  reproduction  we  will  use  three 
parameters  a,  e and  6. 

We  say  that  the  parameters  of  an  apparatus  P are  not 

worse  than  a,  c,  6 if  for  every  function  f z B the  corre- 

a 

sponding  function  f*  = P(f)  is  close  to  f(t).  In  other 
words,  an  apparatus  has  parameters  a,  e and  <5  if  it 
records  and  reproduces  signals  so  precisely  that  for  any 
signal  with  spectrum  a the  output  signal  is  close  to  the 
input  signal. 

In  engineering  problems,  for  an  equipment  with  parameters 
a , e,  6,  the  number  D = 20  log^d/6)  is  called  the 
dynamic  range  of  the  equipment.  It  is  said  that  the  equipment 
has  a wide  dynamic  range,  if  both  large  and  small  signals 
can  be  reproduced  with  the  same  accuracy. 

Having  given  all  the  necessary  definitions,  we  can  now 
formulate  the  result. 

6 Estimate  of  the  code  density 

For  any  positive  numbers  a,  e and  6,  it  is  possible 
to  construct  an  apparatus  with  parameters  not  worse  than  o, 
e,  <5,  and  with  complexity  characterized  by  the  inequalities 

h < — log  — and  Z < max 

— we  — 
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where  C is  an  absolute  constant.  It  should  be  pointed  out 
that  the  right  hand  side  of  the  first  inequality  above  does 
not  contain  the  parameter  6.  This  means  that  is  is  possible 
to  construct  an  apparatus  with  an  arbitrarily  wide  dynamic 
range  D by  using  codes  with  density  independent  of  D. 

This  seems  rather  unexpected,  because  in  engineering 
problems  another  point  of  view  prevails:  Namely,  in  con- 

structing an  apparatus  with  the  analogue  system  of  recording 
a sufficiantly  wide  dynamic  range  is  most  difficult  to  obtain. 
Hence,  we  should  not  expect  that  a wide  dynamic  can  be  ob- 
tained without  any  difficulties  at  all.  In  the  digital  system, 
for  example,  obtaining  a wide  dynamic  range  requires  either 
long  codes  or  complex  schemes  of  coding.  In  addition,  it 
should  also  be  noted  that  it  is  impossible  to  construct  an 
apparatus  with  infinite  dynamic  range  using  codes  of  finite 
density . 

7 Entropy  of  the  class 

The  estimate  of  code  density  consists,  as  usual,  of 
counting  the  entropy  of  the  corresponding  function  class. 

Let  the  numbers  a,  e,  6 and  r introduced  above  be 
fixed,  and  let  B*  be  a set  of  functions  defined  on  a closed 
interval  [ — T , T ] . This  set  is  called  a net  of  the  class  B 

a 

on  the  segment  [-T,T],  if  for  any  function  f e B^  there 
exists  a function  f*  e B*  close  to  f on  [-T,T];  meaning 
that  for  any  t e [-T,T]  the  following  inequality  holds: 

| f ( t ) - f*(t)|  <_  max  | f ( t ) | + 6. 

t- r <x<t+r 

Denote  by  N(T)  the  number  of  elements  of  the  minimal  net 
of  the  set  B^  on  [-T,T] . The  number  H(T)  = log  N(T) 
is  called  the  ( e, 6)-entropy  of  the  set  B^  on  [-T,T]. 

The  following  theorem  can  be  found  in  [3]. 


rid 
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THEOREM  7.1  Let  a,  e £ 1,  6 <_  1 and  r >_  1/a  be  positive 
numbers.  Then 


H(T) 


2aT  C 

7r  max{£,6} 


for  all  sufficiently  large  T,  where  C is  a positive  func- 
tion of  a,  e,  6,  r and  lies  between  two  absolute  positive 
constants  and  C^. 

Denote  by  H = H(a,e,6)  the  minimum  of  the  code  density 
h = h(P)  taken  over  all  apparatuses  with  parameters  a,  c 
and  6.  It  can  easily  be  shown  that 


H = lim  H(T) . 

T-w  z 

Indeed,  for  any  T,  on  the  one  hand,  any  apparatus  with 
parameters  a,  e and  6 generates  a net  of  the  class  B 

a 

on  the  segment  [-T,T]  (this  net  is  the  set  of  all  output 

signals  when  the  input  signals  are  all  functions  from  the 

class  B^),  an<^  on  other  hand,  any  net  can  be  looked  upon 

as  an  apparatus  which  puts  in  correspondence  to  every  function 

from  B one  of  the  nearest  elements  of  the  net. 
a 

Hence,  the  theorem  just  formulated  implies  that 

„ a , C 

H = — log  7 7— i 

it  maxi  e,  6 } 


That  is,  the  code  density  of  the  most  economical  apparatus 

with  parameters  a,  e,  6 is  equal  to 

a , C 

— l02  . 

tt  max{  e,  6}  * 


8 Discussion  of  the  result 

If  we  put  6=0  and  take  e sufficiently  small,  then 
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k 

the  constants  1/2  (k  running  over  all  positive  integers) 
are  pairwise  distant,  i.e.,  none  of  these  constants  is  close 
to  another.  Hence,  the  entropy  H(T)  is  infinite  and  con- 
sequently, H = This  means  that  there  is  no  apparatus 
with  an  infinite  dynamic  range. 

Recall  that  the  definition  of  closeness  of  signals  contains 
the  parameter  r.  We  have  been  assuming  all  along  that 
r >_  1/a.  If  we  put  r = 0,  then  the  corresponding  value  H 
turns  out  to  be  equal  to 

o C 

tt  108  min{e,5}  ’ 

where  the  constant  C is  again  understood  to  be  a positive 
function  of  all  parameters  separated  from  zero  and  infinity. 
Hence,  in  the  estimate  of  H the  symbol  min{e,6}  replaces 
max{e,6}.  That  is,  in  the  case  when  r = 0 and  6 < e, 
the  code  density  H of  the  most  economical  apparatus  turns 
out  to  be  equal  to  (a/ir) log(C/<$) . We  remark  that  in  this 
case,  H happens  to  be  essentially  dependent  on  the  parameter 
6 . 

This  circumstance  shows  that  the  conclusion,  that  there 
exists  an  apparatus  with  a wide  dynamic  range  and  relatively 
small  code  density,  is  correct  as  long  as  the  choice  of  metric 
is  reasonable. 

The  notion  of  closeness  of  signals  has  been  defined  to 
correspond  to  the  system  of  measurements  which  at  present  is 
used  in  radio engineering.  The  condition  r >_  1/a  seems 

to  be  a natural  one  as  well,  because  errors  of  reproduction 
are  usually  related  to  the  energy  of  the  signal  for  each 
certain  period  of  time  and  not  to  the  momentary  value  of  the 
signal.  For  sinusoidal  signals,  for  example,  the  error  is 
usually  related  to  the  energy  per  period  of  the  oscillation. 

So  there  is  hope  that  our  choice  of  metric  is  resonable 
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and  our  conclusion  is  correct. 

9 Estimate  of  capacity  of  a communication  channel 

Our  talk  about  coding  has  been  centered  around  the  sound 
recording  problems.  But  the  estimate  presented  actually  re- 
lates to  arbitrary  signals  with  finite  spectrum  and  therefore 
can  be  used  in  other  applications.  For  example,  the  result 
may  be  looked  upon  as  the  estimate  of  the  capacity  of  a com- 
munication channel. 

Any  radio  communication  channel  uses  signals  with  finite 
spectrum  and  hence  can  be  interpreted  as  an  apparatus.  In 
this  case  we  may  use  the  parameters  a,  e and  6 to  charac- 
terize the  frequence  range  of  the  chennel,  nonlinear  dis- 
tortions of  the  chennel  and  the  level  of  channel  noise.  The 
entropy  H(T)  of  the  corresponding  class  characterizes 
the  information  content  of  the  signals  and  the  number 
H(c,e,6)  turns  out  to  be  equal  to  the  channel  capacity. 

The  fact  that  H does  not  essentially  depend  on  the 
parameter  6 when  6 is  sufficiently  small  with  respect 
to  e means  that  the  channel  capacity  does  not  depend  on 
the  level  of  channel  noise  as  long  as  the  noise  is  sufficient- 
ly small  with  respect  to  the  distortions. 

10  Estimate  of  derivatives  of  polynomials 

We  now  present  a result  obtained  while  proving  the  above 
theorem.  It  seems  to  be  of  some  interest  in  itself. 

Let  P(t)  be  a polynomial  of  degree  k and  M = 
max{  | P ( t ) | : -1  <_  t <_  1} . By  Bernstein's  theorem  the  deriva- 
tive of  P at  the  origin  satisfies  the  inequality 
| P ’ (0)  | <_  Mk.  It  is  well  known  that  this  estimate  is  sharp. 

V.  I.  Buslaev  [1]  has  found  another  form  of  estimating 
derivatives  as  follows: 

If  the  polynomial  P(t)  has  real  coefficients,  then 
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k-q 

| P * (0)  | <_  ApM,  where  p = 1 + q + I 

i=l 

A is  an  absolute  constant,  q the  number  of  the  zeros  of 
the  polynomial  P located  in  the  unit  disk  |t|  <_  1 and 
{r^}  are  the  roots  of  the  polynomial  located  outside  the  unit 
disk. 

Polynomials  which  arise  as  approximations  of  entire  func- 
have  widely  scattered  zeros.  For  such  polynomials  in  particu- 
lar, this  estimate  turns  out  to  be  much  more  effective  than 
Bernstein's  theorem.  However,  for  polynom-'  .s  with  complex 
coefficients  in  general  this  estimate  is  not  valid.  This 
can  be  seen  from  the  counter-example 

P(t)  = ( 1 + — 

\ /k 

In  [2]  Buslaev  presented  an  estimate  of  the  derivative 
of  a polynomial  in  terms  of  the  characteristic  p in  the 
integral  metrics  and  a generalization  of  Nikolski's  inequality 
relating  norms  of  a polynomial  in  various  metrics. 

11  Scheme  of  construction  of  nets 

Let  us  assume  that  a = tt  and  r = 1.  The  representation 
of  entire  functions  cited  in  section  1 asserts  the  poss- 
ibility of  approximating  a function  f e B uniformly  on 

IT 

[-T-1,  T+l]  for  large  T to  within  6 by  a function  of  the 
special  form  R(t)P(t).  Here,  the  factor  R(t)  is  a uni- 
versal function  not  depending  on  the  choice  of  the  function 
being  approximated,  and  the  factor  P(t),  which  depends  on 
the  choice  of  the  function  being  approximated,  is  a poly- 
nomial of  degree  n 2T  + o(T) . The  problem  is  thus  reduced 
to  the  calculation  of  the  quantity  H^  ^ for  the  class  of 
functions  of  the  form  R(t)P(t) . The  function  R(t)P(t) 
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is  uniquely  determined  (to  within  a constant  factor)  by  the 
collection  A = {a^.-.-.a^}  of  the  zeros  of  the  polynomial 
P(t).  If  two  collections  of  zeros  A and  A are  close  in 
a certain  sense,  then  the  function  R(t)P(t)  determined  by 
the  collection  A is  (e,0)-close  to  the  function  R(t)P(t) 
determined  by  the  collection  A.  The  accuracy  of  approxi- 
mations of  zeros  is  characterized  in  terms  of  the  density  of 
the  zeros.  A special  characteristic  p is  introduced  (see  sec- 
tion 10)  to  describe  the  density  of  a collection  of  zeros. 

The  most  essential  problem  in  computation  of  the  entropy 
is  the  finding  of  an  upper  bound.  A lower  estimate  can 
easily  be  obtained  from  the  estimate  of  the  e entropy  of 
in  the  uniform  metric  (cf.  [4]). 

As  we  see  from  the  above  scheme,  the  construction  is 
based  on  the  possibility  of  the  representation  of  polynomials 
in  one  variable  by  producing  linear  functions.  Polynomials 
in  several  variables  do  not  have  the  same  kind  of  representa- 
tions. This  is  the  main  obstacle  for  obtaining  an  asymptotic 
formula  for  the  (e, 6)-entropy  of  the  class  of  entire  func- 
tions in  several  variables. 
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ON  THE  CONVERGENCE  OF  SMOOTH  MONOSPLINES 
TO  POLYNOMIAL  MONOSPLINES 

R.  B.  Barrar  and  H.  L.  Loeb 

1 Introduction 

In  many  problems  in  numerical  analysis  optimal  mono- 
splines play  an  important  role,  [1,2, 5, 8, 9].  For  example  in 
several  applications  one  wants  to  characterize  the  monosplines 
of  least  norm  where  both  the  knots  and  coefficients  are  free 
parameters  but  the  multiplicities  are  fixed  odd  positive  inte- 
gers. We  phrase  this  monospline  problem  in  the  following  way. 
Let  K(x,y)  be  a given  real-valued  function  on  [0,1]  x [0,1]. 
Further  consider  the  fixed  positive  integers  {n,m^, . . . ,mt> 
where  m^  is  odd  and  is  called  a multiplicity.  We  can  state 
our  problem  as  follows: 

Problem  I.  Among  all  monosplines  of  the  form 

1 t mj-1  , . 

(1)  M(x)  = / K(x,y)dy  + E E a Ku'(x,?) 
o i=l  j=0  2 

n-1  (1) 

+ E a Ku;(x,0) 

j=0  2 

where  the  knots  satisfy  0 < < 1 and  the  coefficients 

a^.a^j  are  real  numbers,  find  the  M(x)  of  minimal  L^  norm 
on  [0,1]  (1  <_  p <_  “) . Here 


K(J)(x,C1) 


J 

-^rKCx.Ol  . 

* M 


If  we  allow  the  knots  in  Problem  I to  coalesce  we  have: 


Preceding  page  blank 
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Problem  II.  Among  all  monosplines  of  the  form 


1 f m'i-l  i(j)  n-1 


(2)  / K(x,y)dy  + £ £*  a KU'(x,{  ) + z a.K(:))(x,0) 

-i  -n  lj  1 • n J 


i=l  j=0 


j=o 


where  {m|,...,m^,}  are  any  set  of  positive  integers  so  that 
for  some  integers  0 = r^  < < ...  < r l+1  = t we  have 


i+1 

m!  = £ m. 

1 


(i  = 1, 


,t'), 


we  seek  the  monospline  of  smallest  norm  (1  < p i ®),  Re- 

call (m^,...,m  ) are  a fixed  set.  Note  that  the  class  of 
functions  defined  in  Problem  I is  a subset  of  the  functions 
considered  in  Problem  II. 

In  the  first  portion  of  this  paper  we  show  that  if 
K(x,y)  is  a smooth  extended  totally  positive  kernel  [6,  p . 49 ] 
then  Problem  II  always  has  a solution.  Further  we  prove  that 
any  solution  to  Problem  II  is  a solution  to  Problem  I and  we 
characterize  all  these  solutions. 

In  the  second  portion  of  the  paper  we  consider  the 
polynomial  monospline  kernel, 


K0(x,y)  = n(x  - y)"-1 


where 


n-1  / xn  1 x >.  0 

X+  | 0 x < O' 


It  is  well  known  that  KQ(x,y)  is  a totally  positive  kernel, 


[6,  p.511].  We  seek  a solution  to  Problem  I in  the  uniform 
norm  when 

plicity,  m^,  satisfies 


norm  when  KQ(x,y)  is  used  as  the  kernel  and  each  multi- 
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Since  KQ(x,y)  is  not  extended  totally  positive  and  also  not 
smooth  we  attack  this  problem  by  considering  for  each  e > 0 
the  smooth  kernel  K£(x,y)  obtained  from  KQ(x,y)  by  employ- 
ing the  Gaussian  Transform;  that  is, 

00 

K (x,y)  = / exp [-  ^~(x  - 5)2]K  (?,y)dC. 

Karlin  [6,  p.512]  has  shown  that  K^(x,y)  is  analytic  and 
extended  totally  positive.  Our  earlier  results  show  then  that 
we  have  at  least  one  solution  for  the  smooth  kernel  and  each 
solution,  M£(x),  has  the  property  that  there  are  alternation 
points, 

0 <.  x.^  < x2  < ...  < xd  <.  1 

where 

t 

d = n 4-  E (m.  + 1)  + 1 
i=l 

so  that 

(3)  Me(Xi)  = -Me(xi+1)  i = 1»  • • • > d-1 
IlMjl  - iM^Xj)! 

and  we  can  demonstrate  that  in  this  setting  x^  = 0 and 

x^  = 1.  Our  principal  result  is  that  for  some  sequence 

{e.HO,  M (x)  converges  uniformly  to  an  optimal  polynomial 
1 i 

mcnospline  M (x) . Since  M (x)  t 0,  it  follows  that  M (x) 
o o o 

also  satisfies  (3),  nontrivially. 
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2 Main  results 

We  will  state  our  main  theorems  without  detailed  proofs. 
The  proofs  will  appear  elsewhere. 

THEOREM  1.  Let  K(x,y)  be  a smooth  extended  totally  positive 
kernel.  Then  for  each  1 i p < «>;  Problem  II  has  a solution. 
Further,  each  solution  is  of  the  form  (1)  where 


(4)  a <0  (i  = 1 t). 

i.m,, 


In  the  uniform  case  each  best  approximation  satisfies  (3) . 

Proof . Using  the  techniques  of  [3]  one  can  show  that  a best 
approximation  exists  for  Problem  II.  In  fact,  the  only  new 
technique  needed  to  complete  the  proof  involves  a parametric 
method  for  separating  a set  of  knots  which  have  coalesced. 

The  concept  of  "extended  varisolvence"  [4]  and  Newton's  Iden- 
tity turn  out  to  be  quite  helpful  in  this  instance. 

For  each  e > 0,  let  M (x)  be  a best  monospline  approxi- 

E 

mation  to  Problem  I in  the  uniform  norm  for  the  kernel 

K^(x,y).  Further  let  M^^(x)  be  the  polynomial  monosplines 

which  is  obtained  from  M (x)  by  replacing  K (x,y)  by 

e e 

K (x,y).  By  going  to  a subsequence  we  can  assume  as  e + 0, 

° (0) 
the  knots  associated  with  each  M (x)  converge.  We  con- 

e 

sider  the  form, 


M(0)(x) 

e 


n-l 

xn  + I a X1  + S (x)  + S (x)  + S (x) 
^ n l,e  n, e g 


where  we  have  developed  M^^  00  in  terms  of  divided  dif- 
ferences. Here  S , (x)  involves  differences  up  to  order 

n-l,e 

n-l  of  all  knots  which  have  limit  points  in  (0,1). 

S (x)  consists  of  the  B splines  [6,  p.529]  for  these  "in- 

II)  8 

terior  knots."  S (x)  involves  all  the  terms  which  have  knots 
e 

which  converge  to  "0"  or  "l". 
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THEOREM  2.  For  0 < e <.  £q,  the  sequence, 
_ n-1  * 

{x  + Y a x + S n (x) }, 
iio  i,e  n-l,e 

is  uniformly  bounded. 


Proof.  The  fact  that  the  Gaussian  Transform  is  variation 
diminishing  [6,  p.20,103]  plays  a key  role  in  the  proof. 

THEOREM  3.  For  some  sequence  {e^}+0  M^^  (x)  converges 

uniformly  to  a M^(x)  of  the  form  (1).  further  Mq(x) 
satisfies  (3)  and  (4)  and  is  a solution  to  Problem  I in  the 
uniform  norm  for  the  kernel  KQ(x,y). 

Proof.  A key  ingredient  in  the  proof  is  a sharp  upper  bound 

on  the  number  of  alternations  Mq(x)  loses  because  of  the 

fact  that  the  knots  coalesce.  The  theory  of  B splines 

[6,  p.531]  and  Theorem  2 are  used  to  establish  this  bound. 

This  last  theorem  partially  answers  a question  posed  by 

Micchelli,  [8].  We  note  that  if  all  the  m^  = 1,  Mq(x)  is 

the  unique  minimizer,  [10],  and  we  find  in  this  situation  that 

any  sequence  {e.HO  has  the  property  M (x)  converges 
l ei 

uniformly  to  Mq  (not  just  a subsequence). 
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QUANTITATIVE  KOROVKIN  THEOREMS  FOR  L -SPACES 

P 

H.  Berens  and  R.  A.  DeVore 

It  is  the  aim  of  the  paper  to  formulate  and  prove  a quan- 
titative Korovkin  theorem  for  sequences  of  positive  linear  con- 
tractions on  the  space  Lp[a,b],l  < p < °°,  and  to  examine  in 
what  sense  the  given  estimates  are  best  possible. 

1 Introduction 

Korovkin 's famous  theorem  on  positive  linear  operators  can 
be  put  into  a quantitative  form. 

Let  c[a,b]  be  the  space  of  all  continuous  real-valued 
functions  on  the  compact  interval  [a,b]  endowed  with  the  su- 
premum  norm  II  * H oo » an<^  ^et  (L  : n = *»2***)  be  a sequence  of 
positive  linear  contractions  on  c[a,b],  One  version  of  Korov- 
kin's theorem  is  the  following: 


Let  4>^(x)  = x , i = 0,1,2,  and 


ln  = . max  II  Vi  ' M «■ 

i =0 , 1,2 


For  any  f e c[a,b] 

(1)  l|  Lnf  - f ||.  < C V||  f |L  + m2(f;Xn)},  n = 1,2,..., 

with  cu 2 ( f ) the  second  order  modulus  of  smoothness  of  f and 

C a constant  depending  only  on  the  interval. 

If,  in  particular,  L <p . = . , i = 0,1,  for  n = 1,2,.., 

2 nil 

then  >n(x)  * ^n^C*)  ~ — 0 and  t^ie  following  pointwise 

estimate  holds  true 

| Lnf(x)  - f (x)  | < C w2(f;Xn(x)), 


I 

! 
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with  C again  only  dependent  on  the  interval. 

For  translation  invariant  operators  on  C2n  Korovkin  him- 
self proved  a quantitative  theorem,  so  did  T.  Popoviciu  for 
Bernstein-type  operators  already  in  1950.  For  arbitrary  sequen- 
ces of  positive  linear  operators  R.  G.  Mamedov  and,  independent- 
ly but  later,  0.  Shisha  and  B.  Mond  gave  estimates  involving 
the  modulus  of  continuity  of  the  function.  The  theorem  stated 
above  is  due  to  G.  Freud  [5]  in  1968.  For  further  references 
see  [3]  . 


It  was  observed  almost  coincidentally  by  Ralf  L.  James,  by 
D.  E.  Wulbert  and  W.  Kitto,  and  by  G.  G.  Lorentz  and  one  of  the 
authors  [l]  that  the  set  already  forms  a "test  set" 

for  sequences  of  positive  linear  contractions  on  the  Lebesgue 
spaces  Lp[a,b],  I <.  p < 00 . In  view  of  the  quantitative  theo- 
rem stated  above,  it  is  natural  to  ask  whether  there  are  ana- 
logue estimates  for  the  L^-spaces  involving  the  modulus  of 
smoothness  and  the  order  of  approximation  of  the  functions 
(jw,  i =0,1. 


It  is  the  aim  of  the  paper  to  formulate  and  prove  such 
theorems  and  to  examine  in  what  sense  the  given  estimates  are 
best  possible. 


2 Main  Results 


For  an  f e 
continuity  by  w 
smoothness  by  w 


L 

P 
1 >P 


2,P 


[a,b] 

(f), 

(f); 


, 1 <.  p < °°,  we  denote  its  modulus  of 
and  its  second  order  modulus  of 
the  latter  being  defined  by 


C f ; 6 ) = sup  {J  |f (x+t)-2f(x)  + f (x-t) jPdx} '^P, 
^,P  0<t<6  bt 


where  = [a+t,b-t],  0 < t < (b-a)/2. 

Let  (L^:  n = 1,2,...)  be  a sequence  of  positive  linear 
contractions  on  Lp[a,b],  and  let 
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(2) 

X = { max 

n*P  i=0, 1 

THEOREM.  For  any 

(3) 

||l  f - f|| 

11  n 11  p 

If, 

in  addition, 

(4) 

llL  f - f || 

" n 11  p 

L 4>  • - <P  • } 

1 n i 1 11  p 


1/2 


f e Lp[a,b]  and  n = 1,2,... 


< c{x2/p  ||  fj 

n,p 


+ ( f ; X 1 ^P)  } 

P 2,p  n,p 


L 6 = <b  , then 

no  o 


< C'{X2/pw,  (f;b-a)  + w,  (f;X1/p)}, 

n, 1 1 ,p  2,p  n, 1 


The  constants  are  only  dependent  on  the  interval  [a,b]  and 
on  p . 

The  proof  depends  on  two  observations,  formulated  in  the 
following  two  lemmas.  The  first  one  is  that  for  L^-spaces 
Theorem  1 1 in  [l]  can  be  given  a quantitative  form. 

As  usual,  we  denote  by  f the  positive  part  of  the 

function  f. 


LEMMA  1 . For  any  f e L [a,b]  , 1 < p < 00 , and  n = 1,2,... 


(5)  ||  Lf  - f ||  < p 1 /p  ||  L f - f ||  '/p  ||f||  ' 1/p+  ||  L f-f  || 

n + + " p 11  n 11  p 11  11  p 11  n hp 

Proof.  Since 


L " fj  < 
n + + 1 1 p 


L f - (L  f)  I + 
'n+  n +Mp 


+ IKLnf>+  - fJ  p and 


(L  f)  - fl  < L f - f , 
n + +i!p  - "n  'ip’ 


estimate  (5)  is  proved  if  we  can  show  that 


<6>  HL„f+-  a„f)+l| ' < p||  v - f |1  p || f ||  p-1 . 

Observe  first  that  when  g,h  > 0,  ||g||  P + ||h||  P < ||  g + h|| 


Taking 

g = (Lnf)+  and  h 

= L f - 
n + 

<L„f),  > 0, 

it  follows 

that 

L f - (L  f)^  ||P  < 
n + n + 11  p - 

||  L f ||  P 
11  n + 11  p 

- II  «•„£>♦  1 

lp 

'p  • 

Since 

L is  contractive, 

n 
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HV*-  <L„»Jlps  llfJIPP-  ll'VU'p 

±p<IMp-  IUV)j|p)  jfjij-. 

from  which  inequality  (6)  easily  follows. 

REMARK.  For  p = 1 inequality  (5)  simply  reads 
llLnf+  “ f+l|  i ^ 2 l|Lnf  " f ||  ]•  Consequently,  if 

{yn  > 0,  n = 1,2,...}  is  any  (null)  sequence,  then  the  class 
of  functions 


{f  e Lja.b]  : ||Lnf  - f ||  , < CfYn>  n = 1,2,...} 

is  a vector  sublattice  in  Lj  [a,b] . This  does  not  hold  true 
for  1 < p < 00  as  the  examples  given  in  Sec.  3 will  show. 

For  1 < p < °°,  let 

w?  n[a>b]  = e b [a>b]  : f(x)  = It  (x-u)  g(u)du  a.e., 

g e L [a,b]  } . 

For  f e W„  [a,b] , f and  f'  are  absolutely  continuous 
on  [a,b]  and  the  associated  function  g equals  f".  In 
other  words,  ^[a.b]  equals  [a,b]  modulo  the  linear 

functions.  Furthermore,  let  V„  . [a,b]  denote  the  relative 
completion  of  j [a,b] , i.e., 

V2  j [a,b]  = {f  e L | [a , b]  : f(x)  = jb (x-u)+dp (u)  a.e., 

U a function  of  bounded  variation  on  [a,b]}. 

Clearly,  for  such  an  f,  we  have  f'  = p. 

LEMMA  2.  For  f e [a,b]  , 1 < p < °°, 

(7)  II V £l|p  <-  CX^Jll  £"  ||  p.  n = 1,2,... 

More  generally,  for  f e V„  [a,b] 

^ 9 ' 


(8) 


L f - f 
1 n 


< C*2/p  [var  f’lb  , n = 1,2,... 
n,p  L Ja  ’ 
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The  constant  C depends  only  on  the  space  L^ia.b  . 

Proof . If  f e ^[a,bJ,  it  follows  from  its  very  definition 
that 


L f - f 
n " p 


< r ||L  ((.-u)  ; .)  - (•  -u)  ||  |f"(u)  jdu. 

' d ll  t t n 


Setting 

(9)  an  n = SUp  llLn(("u)  + ; •>“<*  -u)+H  D’  n = 1,2,... 
a<u<b  v 


we  obtain 


| L f - f ||  < a2  llf"  ||  . < (b-a) 

n 11  p n,p  11  11  1 — 


1 ' /p  a2  l|  f " ||  . 

n,p  11  11  p 


1/P 


By  Lemma  1,  a < const.  \ y , this  proves  (7).  The  proof 

n,p  “ n,p  p K 

of  (8)  is  similar. 

REMARK.  V„  . fa,bj  forms  a normed  linear  space  under  the  norm 

^ b ^ 9 ^ 

[Var  f']  . Observe  that  the  approximation  behavior  of  a se- 
quence (L^)  on  the  unit  ball  of  j [a,b]  is  determined  by 

the  behavior  of  the  extreme  points  of  the  ball.  Here  the  ex- 
tremal functions  are  given  by  e^Cx)  = ±(x-u)+,  a < u < b. 
Hence 

sup{  1 1 f — f II  i : f t ^ | and  [Var  f * ] < 1} 

= sup  ||L  ((.-u)  + ;0  - (•  -u)J  . 
a<u<b 


By  Lemma  1,  || Ln ( (•  -u)  + ; •)  - (-  -u ) + | ] < 2 [| L^ ( (•  -u)  , • ) - 

~ (•  ~u)  ||  | £ 2 ( 1 +max(  | a | , | b | ))  i2  | . This  shows  clearly  that 
for  functions  f e j [a,b]  the  approximation  behavior  of 
L f towards  f is  determined  by  that  for 


b and  d> , 
Ko 


Ilf  II 

(10) 


Introducing  on  L^  [a,b] , 1 < p < <*>,  the  norm 

f 1 1 + ||  f"  ||  , it  follows  from  Lemma  2 that 

ii  p ii  u p> 

L f - f ||  < Ca2  ||f||„  < C'X2/p  ||  f ||  _ , n = 1,2,.. 

n 1 p n,p  11  11  2 ,p  n,p  11  11  2,p’  ’ ’ 

(2). 


2»P 


Proof  of  the  theorem.  Let  K^(t;f)  :=  Kp(t;f;LpL  ),  t > 0, 
denote  the  Peetre  K-function  norm  of  the  function 
f e L [a , b]  , i . e . , 
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K (t;f)  = inf  { ||  f-g ||  + t ||g| 


2»P 


: g c iV  [a , bj  } . 


By  a well-known  result  on  moduli  of  smoothness,  see  H.  Johnen 

L71  or  0]  > 


Kp(t  ;t)X  min(  1 , t ) ||  f ||  p + w2  p ( f ; t ) . 

(2) 


+ ) 


Hence  for  any  f e L^[a,b]  and  any  g e ^[a,b] 


IV  - f llp  i UVf-g)  " (f-g>llp  + 


!Lng  - g| 


< 2 f - g 


+ Ca 


I 8 I 


p n,p  2 ,p ' 

(2) 


Taking  the  infimum  over  all  g e Lp  7[a,b], 

(11)  ||Lnf  - f ||  p < max(2,C)  Kp(a2^;f) 

< C'{a2  ||f||  + to-  (f;a  )}. 

n,p  11  11  p 2 ,p  n,p' 

As  was  remarked  earlier,  a < const.  a'^P.  Putting  this  in 

n,p  “ n,p 

our  last  estimates  gives  the  theorem. 

To  prove  the  estimate  (4),  let  us  remark  that,  since 
Vo  S V I Ln ( (*  _u V 5 x)  * (x-u)+l  i 2 (b-a)  a.e.,  and, 

< 22_1/p(b-a)1_,/p  X2/P. 


consequently,  a 
2 


where 


n,p  - ' ' n, 1 ’ 

A“  | * ||Ln<J>|  “ | • Furthermore,  for  any  constant-valued 

function,  say,  c, 

2 


L f - f < C {a' 

n r-%  — r 


inf 

c 


If  - c||  + to  (f;a  -.)!• 

P 2 ,p  n,p 


Since  inf{  ||  f - c[|  p:  c e R)  <_  const,  aij  p(f;b-a)  the  last 
estimate  implies  inequality  (4). 


REMARK.  A more  careful  analysis  of  the  constants  appearing  in 
the  proof  will  show  that  constants  C and  C'  can  be  chosen 
which  only  depend  on  the  interval  [a,b]  but  not  on  the  index 
P* 


t)  f(t)  X g(t)  means  f(t)  = 0(g(t))  and  g(t)  - 0(f(t)). 
A similar  notion  is  used  for  sequences. 
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3 Examples 


We  would  now  like  to  see  what  the  theorem  gives  by  study- 
ing a few  typical  examples.  As  the  remark  following  Lemma  2 
indicates  the  techniques  used  in  the  proof  are  techniques  that 
live  in  Lj.  In  most  instances  inequality  (3)  can  be  improved 
by  using  the  estimate  (11)  in  terms  of  ^ (Example  1)  or 

an  interpolation  argument  (Example  2).  However,  the  Examples 
4 and  5 show  that  in  general  the  estimates  (3)  and  (4)  cannot 
be  improved. 

1.  On  L [o,  lj  , 1 < p<  °°,  we  define  for  each  n *=  1,2,... 
the  transformation  f •+  E^f  as  the  conditional  expectation  of 
f with  respect  to  the  field  generated  by  the  set 

;fJL.  Kli)  . k 

Hn+I  » n+ 1 ' ‘ 


Enf(x)  = (n+1) 


By  definition,  (E 
linear  contractions,  E 


n,p 

i 

n,  p 


• • • • 

,n} , i.e 

• » 

k+1 

n+ 1 
k 

f (u)du. 

x e [ 

• k 

n+1  ’ 

k+1 
n+ 1 

•). 

n+1 

n * 

r\ 

1,2,...) 

is  a 

sequence 

of  positive 

E2  = 

E and 

E <)> 

= d>  . 

We 

have 

n 

n 

no 

To 

I)"' 

. One  verifies 

i easi! 

Ly, 

that 

j(P*i>-,/p 

X and,  consequently,  by  (11)  for  any  f e L [0,l] 
n,p 

lE„<f>  - £U  p * C<5“l,p<fi'>  * “2,p<£:  n''/2))- 


n 


1,2,... 


2.  Bernstein-Kantorovif  polynomials.  For  f e L [0,  l] 
and  n - 1,2,...  let 

* B„E„£<*>  - B„,P(x),  F(x)  - J*  £(u)du, 

where  B is  the  well-known  Bernstein  operator.  The  P are 
n r n 

positive  linear  contractions  on  L [0,1]  with  P * = <b  . 

r 2 _ I P L J no  ° 

Here  X ^ (n+1)  . For  p ■ 1 this  leads  to 

n,P 
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02)  l|Pnf  - *11  p £ %<{  * “2,p(f‘"‘‘/2))- 

f e L [o,  ij  , n = 1 , 2 , . . . 


By  Freud's  estimate  (I),  this  inequality  also  holds  true  for 
p = <*>.  An  interpolation  argument  finally  establishes  (12)  for 
all  p , 1 < p i 00 . 

The  inequality  improves  estimates  given  by  R.  Bojanic 

and  0.  Shisha  [2] . For  functions  f of  bounded  varation, 

. . -1/2 
(12)  gives  an  order  of  approximation  n . In  this  case,  a 

stronger  estimate  was  given  by  W.  Hoeffding  [6]  who  included 

pointwise  behavior  at  the  endpoints. 

3.  For  each  t,  0 < t < 1/2  we  define  on  Lp[0,l], 

1 i p < 00 


Ltf (x) 


/ y(f(x+t)  + f (x) ) , 0 < x < t, 

y(f(x+t)  + f(x-t)),  t < x < 1 — t , 

-j(f(x)  + f(x-t))  , 1-t  < X < 1. 


Clearly,  defines  a positive  linear  contraction  on 

Lp[0,lj,  and  = <J>q . Also  A2  ^ t*  + *^.  For  any 

f e Lj[o,l]  and  any  0 < 6 < 1/2 

, ( f ; 6 ) < sup  ||  Lf  - f ||  . < C.{62u).  (f;l)+(1)  (f ; 6 ) > 

* 0<t<6  ’ Z’ 

with  the  right  hand  estimate  derived  from  Theorem  1.  It  fol- 
lows by  the  remark  after  Lemma  I that  the  Lipschitz  classes 

Lip*(a;L,  [0,l])=  {f  e L,[o,l]  : u>2  j (f  ;6)  < Cf6a, 

0 < 6 < 1/2,  0 < a <.  2}, 

are  vector  sublattices  in  Lj[o,l].  This  statement  is  trivial 
for  the  class  Lip(a;L  [0,l]),  0 < a < I,  for  all  I < p < °°, 
but  for  Lip*(a;Lj [0,  1^) , 0 < a < 2,  it  seems  not  so  obvious. 
For  Lp[0*ljt  1 < p < 00 , the  latter  is  obviously  wrong. 
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4.  This  example  will  show  that  (4)  cannot  be  improved. 
For  f e Lp[-I,l]  , I < p < ®,  set 


L f(x)  = 


f(x)  , |x|  < i - - , 

n 

f(-x),  1~  < I x I < I.  n = 1,2,... 


forms  a positive  linear  contraction  on  L [-l,l]  and 
L <t>  = <p  . Furthermore,  X^  X n 


n o 


n,p 


X“'T.  The  function 
n,  1 


f (x)  = x belongs  to  W„  [-1,1],  but  II L f -f 
o + 6 2,pL  J’  "noo 


u -1/p 


5.  Finally,  consider  on  L^[-l,l]  the  sequence  of  opera- 


tors 


L f(x)  = 

n 1 +2_n 


f(x) 


1 ,1 


, 2 n < | x | < 1 , 


j ( ' f(x)dx  , 0 £ | x | < 2 n n = 1,2,.. 

-1 

One  verifies  easily  that  defines  a positive  linear  con- 

traction on  Lp  [“'  » 0 » • i P £.  “•  Also  (1+2  n)Ln<J>o(x)  = 4>  (x) 
and 

-n 


r 4>j  (x),  2 < | x | < 1 , 


( 1 +2~n)Ln4>  | (x)  = { 


0 


x < 2 


-n 


which  implies  that  X^  X 2 n. 

n,P  -n 

For  4>  2 » we  have  (1+2  ^^^(x)  = ^(x),  2 “ < |x|  <.  1, 

and  = 1/3  for  0 < |x|  < 2 n.  Hence 


~n 


, ,-n/p 


lLn*2  - *2'1  p ^ 1 


, l£P 


proving  that  for  sequences  of  positive  linear  contractions  on 
Lp[-l,l],  1 <.  p < °°,  in  general,  the  order  of  convergence 
given  by  (3)  cannot  be  improved. 
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REFLEXIVITY  AND  THE  EXISTENCE  OF  BEST  APPROXIMATIONS 


J8rg  Blatter 

A (nonempty)  subset  A of  a (real  or  complex)  normed 
linear  space  X is  called  proximinal  (or  an  existence  subset) 
if  every  element  of  X has  at  least  one  best  approximation  in 
A.  Problems  in  best  approximation  are  especially  simple  in  re- 
flexive Banach  spaces  because  in  such  spaces  every  closed  con- 
vex subset  is  proximinal  (this  is  a simple  consequence  of  the 
weak  compactness  of  the  unit  ball) . It  is  thus  natural  to  ask 
if  there  is  a broader  class  of  normed  linear  spaces  in  which 
every  closed  convex  subset  is  proximinal.  For  Banach  spaces 
this  question  is  answered  negatively  by  the  following  theorem 
of  R.  C.  James  [1]:  A Banach  space  X is  reflexive  if  and 

only  if  every  continuous  linear  functional  on  X attains  its 
norm  on  the  unit  ball.  Namely,  (as  observed  by  James  [3, 
p.253])  a continuous  linear  functional  on  a normed  linear  space 
attains  its  norm  on  the  unit  ball  if  and  only  if  its  kernel 
is  proximinal.  In  1966,  F.  Deutsch,  in  a letter  to  James, 
asked  whether  a normed  linear  space  in  which  every  closed  con- 
vex subset  is  proximinal  must  be  complete  . After  a couple  of  weeks 
James  provided  a partial  answer  by  exhibiting  an  incomplete 
normed  linear  space  in  which  every  closed  hyperplane  is  proxi- 
minal (much  later  this  result  was  published  in  [2]).  Despite 
this  indication  of  a negative  answer,  Deutsch' s question  does 
indeed  have  a positive  answer: 

THEOREM.  A normed  linear  space  in  which  every  closed  convex 
subset  is  proximinal  is  complete. 

Proof . Let  X be  an  incomplete  normed  linear  space.  We 
shall  exhibit  a closed  convex  subset  K of  X in  which  0 
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has  no  best  approximation.  Let  x be  a point  of  norm  1 in 
the  completion  of  X which  does  not  belong  to  X,  and  let 
x'  e X'  be  of  norm  1 and  such  that  x'x  = 1.  One  easily  con- 
structs a sequence  {x  } in  X which  converges  to  £ and 

n new 

is  such  that  x'x  = 1 + 1/n  for  all  n.  Set  K = co({x  :n  e IN)) 
n n 

(=  closed  convex  hull  of  {x  :n  e IN}) ; x'  is  real  on  K and 

n 

for  each  x e K,  x'x  >_  1.  Since  ||x'||  = 1,  ||x||  > 1 for  all 
x e K.  limjlxjl  = ||k||  = 1 and  so  inf {|| x 1 1: x e K}  = 1,  i.e., 
the  distance  of  0 to  K is  1.  We  shall  thus  be  done  once  it 
is  shown  that  x'x  > 1 for  x e K.  Accordingly,  let  us  suppose 
that  there  is  an  x e K such  that  x'x  = 1.  We  have  obviously 
^ co({x,  :k  > n})  = 0 and  so  there  is  an  N e IN  such  that 

new  k 

x t co({x,  :k  > N)).  Let  {y  } be  a sequence  in 
k Jn  neW 

co({xn:n  e IN})  (=  convex  hull  of  ^xn:  new})  which  converges 

to  x.  Set  A = co({x^:k  _<  N})  and  B = co^x^k  > N}).  We 

can  find  a e A,  b e B and  nonnegative  reals  a and  & 
n n & n n 

such  that  y = a a + B b and  a + B =1.  For  a e A, 
n n n n n n n 

x'a  >.  1 + 1/N  and  for  b e B,  x'b  >,  1.  Thus 


x'y 


n 


a x'a  + B x'b  > a (1  + rr)  + B 
n n n n — n N n 


a . 
n 


Since  {x'y  } converges  to  x'x  = 1,  and  since 
n ne|N 

1 + ^ a ll 
N n 


for  all  n,  {a  } must  converge  to  0,  Since  A is  bounded, 
n ne{N 

{a  a } converges  to  0,  i.e.,  (B  b } converges  to  x. 

n n neW  n n neJN 

Since  { B } converges  to  1,  {b  } _ also  converges  to  x, 

n neJN  n neJN 

i.e.,  x e ^({x^ik  > N}) — a contradiction. 

This  theorem,  of  course,  combines  with  James'  theorem  to 
yield  the  following  characterization  of  reflexivity:  a normed 

linear  space  is  reflexive  if  and  only  if  each  of  its  closed 
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convex  subsets  Is  proximinal.  Because  James’  theorem  is 
comparatively  deep,  it  would  be  nice  to  have  a simple  direct 
proof  of  this  characterization. 
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APPROXIMATE  FUNCTIONAL  COMPLEXITY 


R.  Creighton  Buck 

Necessary  conditions  are  obtained  for  a function  F 
either  to  be  representable  in  the  form  F(x,y,z)  = f(<J>(x,y),z) 
with  continuous, or  to  be  uniformably  approximable  by 
such  functions. 

The  works  of  Vitushkin,  Arnold,  Kolmogorov,  and  others 
have  answered  many  of  the  basic  questions  in  the  theory  of 
functional  representation  and  complexity  ([1],  [6],  [8],  [9]). 
However,  these  methods  often  depend  on  category  arguments 
that  do  not  seem  to  be  suitable  for  deciding  the  status  of 
an  individual  function  (see  [A],  [5],  but  also  [3]).  It  is 
of  interest,  therefore,  to  find  criteria  which  help  to  decide 
if  a specific  function  either  belongs  to,  or  can  be  appro- 
ximated by,  a given  class  of  simple  functions  of  a prescribed 
superposition  format  (see  [2]).  In  this  paper  we  illustrate 
this,  using  as  a test  class  the  functions  of  the  form 
F(x,y,  z)  = f (<J>(x,y),z)  . 

We  write  F°(0)  for  the  class  of  such  F which  have 
this  representation  on  the  region  0,  with  component  functions 
f and  d>  of  class  Cn.  If  n 2,  a characterization  is 
immediate. 

THEOREM  1.  If  F e F 2(0),  then  F satisfies  in  0 
the  nonlinear  equation 


(1) 


F F 
x yz 


F F 
y xz 


0 


Preceding  page  blank 
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Conversely,  If  F satisfies  this  equation  in  0 , then  F 

has  the  desired  local  representation  in  the  set  0 - V, 

where  T is  the  set  where  F = F =0. 

x y 

One  would  hope  for  a similar  characterization  with  only 

continuity  requirements  on  f and  <j>,  perhaps  a weak  form 

of  (1).  As  a first  step  toward  this,  observe  that  Theorem  1 

can  be  given  a different  formulation.  For  any  c,  consider 

the  planar  mapping  T defined  by 

(u  = F(x,  y , c) 

v = Fz(x,y,c) 

Then,  equation  (1)  can  be  restated  as  3 (u, v)/3 (x, y)  = 0. 

2 

Thus,  F e F if  the  mapping  T is  locally  singular,  for 
each  c.  With  this,  one  is  led  to  a characterization, 

based  on  the  observation  that  Fz  is  approximately 
F(x,y,z+h)  - F(x,y, z) . 

THEOREM  2.  Define  a planar  mapping  by 

{u  = FCx.y.c.) 
v = F(x,y,c2> 

Then  3 (u, v) /3 (x, y)  = 0 for  all  (x,y,c.)  in  0 if 

2_  1 
F e F ( 0 ) . Conversely,  if  this  holds,  F is  locally  of 

the  desired  form  in  0 - T . 

This  is  not  yet  the  theorem  sought,  since  it  requires  C^ 
components,  but  it  is  suggestive.  It  is  helpful,  before  going 
further,  to  recast  the  original  problem.  Given  F:  X * Y x Z 

-+  R,  of  the  form  f (<j>  (x,y) , z) , an  equivalent  viewpoint  is  to 
consider  F as  a mapping  from  X x Y into  C[Z],  which  may 
now  be  replaced  by  any  Banach  space  E . The  representation 
problem  is  now  converted  into  a factoring  problem;  given  F, 
find  <j>  and  f so  that  the  following  diagram  commutes  : 
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(4) 


2 

R > R 


If  no  restrictions  are  placed  on  <J>,  this  may  be  solved  for 

2 

arbitrary  F by  taking  <p  as  any  1:1  map  from  R into 
R.  If  we  ask  that  <{>  be  continuous,  F is  severely  re- 
stricted . 


THEOREM  3.  I_f  F has  the  form  shown  in  (4)  and  <j>  is  con- 
tinuous, then  F is  locally  singular  in  the  sense  that  on 
any  open  set,  either  F is  constant  or  F has  a noncount- 
able  number  of  distinct  noncountable  level  sets  (sets  of 
constancy) . 

This  elementary  result  is  sufficient  to  show  that  speci- 
fic functions  do  not  belong  to  the  class  F^  on  any  open 
set  (e.g.,  xy  + yz  + xz) . 

This  result  can  now  be  extended  to  the  uniform  closure 
r° 

of  r by  means  of  a special  property  of  continuous  func- 
2 

tions  <j>:  R -*•  R. 

2 

LEMMA . Let  S be  the  unit  square  in  R . There  is  a con- 
stant C such  that  for  every  continuous  real  function  <f>  on 
S,  and  0 < 6 < 1,  there  is  a level  set  E for  tp  such 
that  N(6 , E)  > C/6. 

Here,  N(6,E)  is  a standard  "size"  function  for  sub- 
sets of  a metric  space,  the  minimum  number  of  sets  required 
for  a 6 cover  for  E.  Using  this  Lemma,  one  may  obtain  a 
criterion  for  approximate  representability . 

THEOREM  4.  If  F : S + E is  the  uniform  limit  on  S of 

~ ° 2 ~ 

functions  F of  the  form  f (4>(x,y) ) where  <j>:  R ■+  R is 
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continuous,  then  must  have  level  sets  of  arbitrarily  large 
finite  cardinal. 

This  shows  immediately  that  functions  such  as 
2 2 2 

xy+yz+xz  orxy  + yz  + zx  do  not  lie  in  the  uniform 
closure  of  F®  on  any  open  set.  Indeed,  a refinement  of 
this  approach  allows  one  to  obtain  an  estimate  for  the  dis- 
tance from  Fq  to  F®  when  Fq  is  sufficiently  smooth. 
These  results  suggest  the  following  conjectures:  (i)  if 

00  ro 

F e C , and  F lies  in  the  uniform  closure  of  r , then 
o o 

F satisfies  the  differential  equation  (1);  (ii)  if  F e C°°, 
o q o 

then  the  uniform  norm  distance  between  F and  F (S)  can 

o 

be  estimated  by  the  minimum  of  |H  (F  ) I where  H is  the 

o 

differential  operator  in  (1). 
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EEST  UNIFORM  HARMONIC  APPROXIMATION 


H.  G.  Burchard 

For  a given  continuous  real  function  on  the  closed  unit 
disc,  we  characterize  its  best  uniform  harmonic  approximation, 
if  this  exists. 

Let  A be  the  closed  unit  disc  in  the  complex  plane  C. 

C (A)  is  the  Banach  space  of  real  continuous  functions  on  A, 
with  the  uniform  norm,  H the  closure  in  C(A)  of  the  real 
harmonic  functions  on  C restricted  to  A,  i.e.,  H consists 
of  the  Poisson  integrals  of  continuous  data  on  bdy(A)  = T. 

If  f z C(A),  then  f need  not  have  a best  approximation 

oc 

in  H.  However,  enlarge  H to  the  set  H of  Poisson  inte- 

CO  CO 

grals  of  L data  on  T.  Then  there  exists  h z H and  a 
minimizing  sequence  (h^)  in  H which  converges  uniformly  on 
compact  subsets  of  int(A)  to  h,  a best  approximation  from 

CO 

H to  f (in  the  space  of  bounded  functions  on  A) . 

DEFINITION  1.  For  a compact  set  K d C let  ft (K)  b£  the  un- 
bounded component  of  C - K and  K'  = C - ft (K) . 

K'  is  the  polynomially  convex  hull  of  K. 

THEOREM  1.  If  feC(A),geH  let 

K±  = (x  c A:  f(x)  - g (x)  = ±|  f - g 

Then  g _is_  ,a  best  uniform  approximation  to  f from  H if 
and  only  if  the  following  condition  holds : 

(1)  I_t  i_s  not  true  that  both  K+  CL  ft(K  ) and  K_CI  ft(K+) . 

One  can  also  express  (1)  by  saying  K+  meets  K'  or  K_ 
meets  K^_. 

Preceding  page  blank 


309 


H.  G.  BURCHARD 


A remark  of  R.  C.  Buck  in  [5]  suggested  to  the  author 
that  there  should  be  such  a theorem. 

The  proof  of  Theorem  1 is  based  on  one  half  of  the  follow- 
ing known  result,  cf_.  [8], 

THEOREM  2 . Given  is  a_  compact  Hausdorf f space  X and  a_  linear 

subspace  A o£  (real)  C(X) . Denote  by  A-*-  the  annihilator 

of  A _in  C(X)*  (AX  = set  of  Radon  measures  orthogonal  to 
A),  _If_  f e C(X)  and  g e A let 

K+  = (x  e X:  f (x)  - g (x)  = ± j 1 f - g c^}. 

Then  g _is_  a.  best  (uniform)  approximation  to  f from  A if 
and  only  if  one  of  the  following  two  equivalent  conditions 
holds : 

(2)  ^ h c A,  h(K+)  > 0 > h(K  ) (elementwise) ; 

(3)  3 u e A1",  u + 0,  supp(y“)  c.  . 


We  also  require  the  following  intuitive  result. 


LEMMA  1.  Suppose  are  compact  subsets  of  the  plane, 

KXC  H(K2),  l^CflCKj.).  Let  KQ  = KjU  K . Then  n K*  - 0 
and  K‘  = K'u  K*. 


Proof . One  first  shows 
cases.  This  shown,  let 


K'  ° K2 


f<l  U f<2  = IR 


and  each  is  a connected  open  set. 


0 by  distinguishing  several 
Then 

It 


no^),  = fi(K2). 


follows  that  A 
Theorem  there  is  an  exact  sequence 

0 


0^  n f^2  is  connected:  By  the  Mayer-Vietoris 


Hj.  OR2) 


H0(A) 


W ® H0(t!2) 


H0®2) 


using  singular  homology  in  the  augmented  case  [2].  But 
and  ft2  are  path  connected,  hence  Hq(Q^)  = = 0,  so 

Hq(A)  = 0,  i.e.  0^0  ^2  ■ C - (K^ U K')  is  path  connected. 
From  this  one  deduces  K"  = U K'. 


310 


harmonic  approximation 


The  following  result  now  implies  Theorem  1. 

PROPOSITION  1.  Let  K^,  b£  compact  subsets  of  A.  Then 

(1)  and  (2),  with  A = H,  are  equivalent . 

Proof . Assume  (1)  fails.  The  conclusions  of  Lemma  1 hold  and 
we  let  u = 1 on  K',  u **  -1  on  K^.  Then  u is  continuous 
on  Kq,  which  is  compact  with  connected  complement.  By  the 
Walsh-Lebesgue  Lemma,  we  can  find  h e H such  that  jh  - uj  < 
y on  bdy(Kg).  By  the  maximum  principle,  h(K^)  > 0 > hCK^)* 
so  (2)  fails  (or  use  Runge's  theorem  in  place  of  the  Walsh- 
Lebesgue  Lemma,  [s.  Minsker's  remark]).  Now  suppose  (2)  fails, 
A - H,  and  h e H such  that  h(K^)  > 0 > hC^).  If  x t K~ 
then  either  x e K^,  h(x)  <0,  so  x f.  K^,  or  x e fi,  a 
bounded  component  of  C - l^.  But  bdy(ft) C K^,  so  h(x)  < 0 
by  the  maximum  principle.  Hence  x t K^.  This  shows 
C and  thus  also  C i.e.,  (1)  fails. 

The  same  method  of  proof  gives  the  analogous  result  for 
harmonic  functions  on  the  unitball  of  R°;  for  generalizations 
of  the  Walsh-Lebesgue  Lemma  (or  Runge's  theorem)  cf_.  [4]. 

PROBLEM.  Replace  harmonic  functions  by  biharmonic  functions 
to  obtain  a subspace  BH  of  C(A).  SH  is  the  convex  cone  of 
continuous  subharmonic  functions  on  A.  Prove  results  similar 
to  Theorem  1 for  BH  and  SH.  The  latter  case  is  analogous 
to  problems  on  the  line  solved  in  [6]. 

The  known  generalizations  of  the  Walsh-Lebesgue  Lemma  to 
solutions  of  elliptic  equations  [4]  do  not  appear  to  be  suf- 
ficient to  solve  the  problem  for  BH.  E.g.,  one  can  show  that 
BH|  is  dense  in  C(K)  if  K is  the  union  of  two  concentric 
circles.  One  of  these  may  be  deformed  into  a rectifiable 
Jordan  curve. 

It  is  of  interest  to  characterize  further  the  sets  K+ 

that 
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can  occur  in  (2)  or  (3).  Let 

~ (K  C X:  K is  closed,  3u  e A^,  y + 0,  supp  y+c  K} , 

3ye  a_L"»  V * 0,  supp  y+C  K,  K n supp  y 

= 0}. 


IL  = (K  e l2:  3 C K,  K e Z }, 


Then  ZQ  C Z < 


'2* 


PROPOSITION  2.  For  A - H,  and  K CZ  A,  K closed,  one  has 
K e = Ei  iff  K _is  not  a proper  subset  of  T;  K e Z^  iff 
K n T = 0 or  K^K". 

One  can  prove  a quite  general  theorem  characterizing 
using  Choquet  Theory.  A is  as  above,  but  in  addition  1 c A 


and 

A 

separates  the  points  of 

X. 

THEOREM 

3.  For  a closed  KC  X 

each  of  tne  following  is 

equivalent  with  K t Z„: 

(4) 

If 

.X  + 

M e A , supp  p C K, 

then  y = 0. 

(5) 

a) 

If  y C AX,  supp  y c 

K,  then  supp  y d K. 

b) 

A|  is  dense  in  C(K). 

(6) 

If 

y c M*(X),  supp  y c K, 

then  y is  the  unique  re- 

presenting  measure  of  y|  . 

(7) 

If 

y e M+(X) , supp  y c K, 

then  V f in  C(X) 

ffu 

= sup{/gy : g e A,  g < 

f }. 

(8) 

a) 

The  barycenters  (in  the 

state  space  S of  A)  of 

probability  measures  carried  by  K form  an  extremal 

Bauer  simplex  (extreme  boundary  is  closed  [l]) 

bary  (K)  d S (we  let  canonically  KCXCZS, 
A C A(S) ) . 


b)  Va 

mass) . 


e bary(K),  then  x e K (e  = unit  point 


312 


HARMONIC  APPROXIMATION 


c)  {e^:  x e K}  = extr (bary (K) ) a extr(S). 

(9)  (5)b)  holds  and  with  T = bary(K) 

V a,  6,  0 < a < 8,  V open  V D K,  3 f^,  . . . , f^eA 

0 > f , V x c T max  f (x)  > -a  > -6  > f (X  - V) . 

1 l<i<n  J 

s s 

If  K = {x},  then  K t if  x e S^CX),  the  Choquet 

boundary  of  A.  The  proof  of  Theorem  3 mostly  parallels  simi 
lar  proofs  for  K = {x},  c£.  [3].  Much  of  Theorem  3 can  be 
extended  to  arbitrary  Borel  subsets  K of  X. 

For  A = BH  one  might  hope  that  E_  = E1  , too.  Note 

that  p c BH  iff  y e H and  r y e H1-  [7].  Also,  9^BH  = A 
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FRACTIONAL  CHEBYSHEV  OPERATIONAL  CALCULUS 
AND  BEST  ALGEBRAIC  APPROXIMATION 

P.L.  Butzer  and  R.L.  Stens 

A basic  difficulty  in  the  fundamental  theorem  of  best 
approximation  of  functions  defined  on  a finite  interval  [-1,1]  , 
say,  by  algebraic  polynomials,  is  due  to  the  fact  that  approx- 
imation is  more  efficient  near  the  end  points  -1 ,+l  than  within 
the  interval.  What  is  moreover  lacking  seems  to  be  a systematic 
approach,  made  possible  in  the  trigonometric  case  by  the  finite 
Fourier  transform.  To  overcome  these  and  other  difficulties,  the 
purpose  of  this  paper  is  to  present  a new  approach  to  this  sub- 
ject based  upon  a Chebyshev  operational  calculus. 

1 Introduction 

The  Chebyshev  transform  of  f€X,  where  X— C[  —1,1] =C  or 
LP(-I,1)=LP,  Kp<°o,  with  norms 

II  fll  - sup  | f (x)  | , II  fll  - (^  / ' | f (u) |P  w(u)du}  1 ^P 
C x€l  —1,1]  p * -1 

2-1/2 

respectively,  where  w(x)  ■ (1-x  ) , is  defined  by 

r<k>  * “ /I,  f (u)Tk(u)w(u)du  (k£P) , 

T^(x)  * cos  (k  arccos  x) , x€J  —1,1]  , being  the  Chebyshev  polynomial 
of  degree  k.  The  associated  translation  operator  for  f£X  is  de- 
fined for  x,h€[-l,l]  by 

(Thf)(x)  = j(f  (xh  + \/(l-x2)  (l-h^j)  +f(xh-\/(l-x2)(l-h2))} 

and  the  convolution  product  of  f€X,  g£L^(-l,l)  by 

(f*g)(x)  - ^ (T^f ) (u)g(u)w(u)du  (x€[  -1,1]). 

2 Fractional  Chebyshev  Derivatives  and  Integrals 
Defining  the  (right)  difference  of  f6X  of  arbitrary  order 
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a>0  with  respect  to  the  increment  h€|  —1,1]  by 

(A^f)(x)  :=  (-1)U]  [”=0(-l)j(“)(Tjf)(x)  (x€[-lfl]), 

where  [ otj  is  the  largest  integer  <a,  and  the  strong  Chebyshev 
derivative  of  fGX  of  order  a as  the  function  g£X  for  which 


lim 
h-H  - 


.a  “ 8"  X 


0; 


( 1 — h) 

=g  if 

f of  order  a>0  is  defined  by 


. ct  . , cx 

we  write  D f=g  if  such  g exists.  The  Chebyshev  integral  I f of 


(Iaf)(x) 


(f  * 4>a)  (x) 

i 


(x€J  —1,1)), 


where  is  the  L^-function  defined  uniquely  by 


*;<k>  - 


(-0 
0 


[ a] k~2a 


k€N 
k=0 . 


THEOREM  1 . The  following  assertions  are  equivalent  for  f€X,a>0: 

(i)  Daf  exists  as  an  element  of  X, 

(ii)  there  exists  a function  g^€X  such  that 

(-l),a,k2V(k)  = g*(k)  (k€N)  , 

(iii)  there  exists  gj€X  such  that  f (x)  = (f^g j ) (x)  + const . (a.e.X 
The  functions  g^  are  uniquely  determined  (a . e . ) apart  from  an 
additive  constant , and  one  has  (Daf ) (x)  *g^(x)  - g?  (0)  (a  .e . ) . 

Setting  :=  {f€X;  Daf  exists  in  X},  one  has  as  immediate 
consequences  of  Thm.  1 that  the  operator  Da:Wy-»-X  is  closed,  and 
the  "fundamental  theorem  for  fractional  Chebyshev  derivatives 
and  integrals": 

Da(Iaf)  = f-f^O)  * la(Daf)  (a.e.)  , 

the  latter  equality  holding  provided  f€Wi.  Similarly  D satis- 

. X 1 
fies  the  additivity  law.  This  shows  that  D f can  be  defined 

equivalently  by  the  limit  in  norm  of  X of  (x,  f-f ) /( 1 -h)  for 

l-i  h 

h-H-,  and  for  r*2,3,...  by  Drf*D  (Dr  f ) . 
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3 Pointwise  Interpretations 

In  order  to  express  the  strong  Chebyshev  derivative  of  or- 
der a>0  in  terms  of  the  usual  pointwise  derivatives,  we  need  to 
define  a conjugate  function  to  f€X,  given  by 

f ~ (x)  = lim  2 rkf~(k)sin(k  arccos  x)  (x€J  — 1 , 1 ] ) , 


r-H- 


whenever  the  limit  exists.  Restricting  the  matter  to  the  sim- 
pler case  X=C(-1,1]  (for  L^(-l,l)  see  [ 2,4])  one  has 


w 


THEOREM  2.  If_  fEW^,  then  one  has  for  each  r£N,  v>a  and  each 


s£P,  2s+)>a  for  x£(-l  ,1) 
<D°£)(x)  - 1 ) r+l  “1  +l  r-“l 


(1-x2) 


dx 


r-a. 


- x — | (I  f)(x), 
dx 


(Daf)(x)=  (-l)t(s»a)v/^?  ^[(l-x2)^-T-x^]8(IS"<1+1/2f))  (x), 


with  t (s  ,a)=s+l+[  a]  +[  s-a+ 1 / 2]  , I°f=f.  The  cases  ot=l , a“l/2  rea£ 

(D,f)(x)  = (l-x2)f"(x)  -xf’(x)  (x€  (-1,1)), 

(D*  ^2f ) (x)  = -Vl-x2'(f~)  ’ (x)  (x£ (-1,1)). 

The  assertions  remain  valid  for  x=±  1 i_f  the  limits  x-*-l - and 
x-*-(-l )+  are  taken  on  the  right  hand  sides . 


4 Lipschitz  Classes  and  Best  Approximation 


Defining  the  Chebyshev  modulus  of  continuity  of  f£X  of  or- 
der a>0  by 

w^(x;f;n)  “ sup  II f II  (n€l  - 1 , 1 ] ) , 

n<h<i  h x 

the  associated  Lipschitz  class  of  order  y>0  is  given  by 
Lip^(X;y)  = (f£X;  </(X;f;n)  - 0((l-ri)Y),  n+1-h 

We  denote  the  best  approximation  of  f£X  by  algebraic  polynomials 

p (x)  of  degree  <n  by  E (X;f) • It  is  well  known  that  there 
n n 

exists  a p* (x) = p* (x;f ) such  that  E (X;f)  ■ II  f-p*ll  „. 

n n n n x 
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Now  to  the  generalization  of  the  fundamental  theorem  on 
best  algebraic  approximation  to  derivatives  and  moduli  of  conti- 
nuity of  fractional  order. 


THEOREM  3 . The  following  assertions  are  equivalent  for 
f€X(=C}  -1,1]  or  Lp)  and  a_<a+$<a  , a,a_>0,  a >0,  CKB<1  : 

— W Z I Z I 


(i)  E (X;f ) = 0(n  2(a+3)) 


(n-*»)  , 


(ii)  (X;  f ;n)  = 0((l~n)a+e)  (n^l  ~) , i.e.  f€Lip^  (X;a+B), 

al  1 


(iii)  D f € 


Lip|(X;B),  0<B<1 


Lip2(X;B),  B=1 , 


(iv)  llDaip*||  = C(n  2(a+S-ai>) 

n a 

(v)  few^2  and  llDa2f-D%*llY  = 0(n2  (a+B_a* > ) 

a “““  n A 


(n-*») , 
(n-*=°)  . 


Let  us  just  compare  our  implication  (iii)  (i)  for 
a=r-l/2,  r£N,  and  X=C[-1,1],  namely  that 

(1)  Dr_l/2f€Lipy(C;B/2)  ■*  E (C;f)  = 0(n'2r+l"B), 

1 n 

with  the  classical  result  of  Timan  [ 8]  , Teljakowskii  [ 7]  , 
Gopengauz  [ 6]  : to  f ^a^£Ct -1  , 1]  , a£N,  there  exists  a sequence  of 
algebraic  polynomials  such  that 

|f(x)-Pn(x)|  - o((i<f7)“a(fw;  ^2)). 

ft(f;6)  sup{  | f (x)-f  (y)  | ; x,y£l-l,l],  |x-y|<6}. 


Neglecting  the  factors 


\/i-x2 


on  the  right,  this  result  states 


in  particular  that  (Lip j (C;B)  * usual  Lipschitz  class) 

(2)  f (2r-l)€LiP] (C;B),  r£N,  0<B<1  -►  En(C;f)  = 0(n"2r+1"6). 

Comparing  the  hypotheses  in  (1)  and  (2),  ours,  equivalent  to 
\Zi-x2  d/dx([  (l-x2)d2/dx2-x  d/dx]  r_lf)~(x)eLipy(C;B/2)  by  Thm.  2, 
is  truly  weaker  than  f'  r ^6Lipj(C;B).  Here  the  multiplicative 


factor  (v4^7)2r  1 of  the  highest  order  der 


ivatxve  compensates 
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the  factors  dropped  above. 

For  other  results  that  have  some  connections  to  particular 
cases  of  ours  see  [ 5] . The  proofs  of  our  results  as  well  as  a 
number  of  others  will  follow  in  [ 1-4]  . 
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ON  THE  COMPUTATION  OF  MINIMAL  PROJECTIONS 


FROM  C[0,1]  TO  Pn [ 0 , 1 ] 

B.  L.  Chalmers  and  F.  T.  Metcalf 

This  paper  discusses  the  computational  implications  of 
several  theoretical  developments  concerning  minimal  projec- 
tions from  C[0,1]  to  Pn[0,l]  (the  polynomials  of  degree 
at  most  n) . Utilizing  these  ideas,  several  cases  have  been 
numerically  treated  for  projections  to  the  quadratics,  and 
the  results  are  given  below. 

1 Introduction 

The  "minimal  projection  problem,"  for  projections  from 
C [0,1]  to  Pn[0,l],  has  generated  a large  body  of  literature, 
very  little  of  which  will  be  cited  here.  This  problem,  though 
simple  to  state,  has  been  settled  only  for  the  case  n = 1, 
or  for  certain  subclasses  of  projections  (see  [1]).  Through- 
out what  follows,  the  projections  will  be  assumed  to  be 
"finite  carrier  projections;"  that  is,  in  the  representation 


p ' WPyv  Li  E c*>  *i  E p„'  Vi  ' V 

the  functionals  L are  carried  on  a finite  number  of  points. 
It  is  known  that  any  projection  is  approximable  by  finite 
carrier  projections  (see  [2]). 

The  determination  of  the  projection  constant  (minimal 
norm)  consists  of  the  min-max-max  problem 

min  max  max  |(Pf)(x)|. 

L.eC*  f eC [0, 1]  xe[0,l] 

1 IML-i 

Computationally,  the  min/max  operations  may  be  viewed  as 

Preceding  page  blank 


J 


1 

i 

i 


< 
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follows,  where  it  is  assumed  that  the  projection  is  carried  on 
m points. 

(I)  max  over  x e [0,1] — Finding  the  extrema  of  an  n^  degree 
polynomial . 

(II)  max  over  f e C[0,1],  1 1 f II  ^ = 1 — Maximizing  over  the  2m 
possible  +1  values  for  f at  the  m carriers. 

(III)  min  over  L ^ e C* — (a)  Minimizing  over  the  m(n+l) 

coefficients  in  the  expansion  of  the  L by  point 
evaluations  (i  = 0,...,n).  (b)  Minimizing  over  the 

choices  for  the  m carriers. 

Problem  (III)  presents  the  major  difficulty  in  the  form  of 
m(n  + 1)  linear  parameters  (the  coefficients)  and  m non- 
linear parameters  (the  carriers).  The  goal  of  what  follows 
is  to  reduce  the  number  of  parameters  in  (III),  and  to  a 
lesser  extent,  the  number  of  choices  in  (II). 

An  immediate  reduction  in  the  number  of  parameters  in 
(Ilia)  may  be  obtained  as  follows.  First,  note  that  P is 
invariant  under  any  invertible  linear  transformation  of  the 

functionals  L = (L  ,...,1  );  i.e., 

o n 

tf1 

P = L ® v = (LA) ® (vA  ) 

for  any  nonsingular  (n  + l)x(n  + 1)  matrix  A.  Thus,  the 

matrix  of  coefficients  of  the  functionals  L may  be  placed 

2 

in  echelon  form,  and  (n  + 1)  of  the  coefficient  parameters 
eliminated.  This  gives,  in  place  of  (Ilia): 

(Ilia')  Minimizing  over  (m  - n - l)(n  + 1)  coefficients 
in  the  expansion  of  the  L ^ by  point  evaluations. 
Also,  the  number  of  carriers  can  be  reduced  by  two,  so 
that  (Illb)  becomes 

(Illb')  Minimizing  over  m - 2 choices  for  the  carriers. 

This  follows  from  the  fact  that  amongst  the  minimal  projec- 
tions on  m carriers,  there  will  be  a projection  having  the 
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end-points  as  carriers. 

2 Symmetry 

DEFINITION.  A projection  P is  symmetric  if 

[Pf](x)  = [Pf(l-.)](1  - x);  f e C [0, 1 ] , x e [0,1]. 

The  following  theorem  serves  to  significantly  reduce  the 
number  of  parameters  in  both  (Ilia')  and  (Illb').  In  seeking 
minimal  projections,  attention  need  be  directed  only  to  those 
projections  having  symmetric  carriers. 

THEOREM.  Amongst  the  minimal  projections  from  C[0,1]  _to 
P [0,1]  there  exists  a symmetric  projection. 

Proof . Let  P be  defined  by 

[Pf](x)  = [Pf(l--)](1  - x);  f e C[0,1],  x e [0,1]. 

Then  (P  + P) / 2 is  a symmetric  projection  such  that 

II  (P  + P)/2||  ±||P||  = ||P||. 

By  restricting  attention  to  symmetric  finite  carrier 
projections,  problems  (Ilia')  and  (Illb')  may  be  replaced  by: 
(Ilia")  Minimizing  over  [ (m  - n - 1) (n  + 1) / 2 ] coefficient 
parameters . 

(Illb")  Minimizing  over  [ (m  - 2)/2]  carrier  parameters. 

(The  bracket  symbol  is  used  here  to  denote  integer  part.) 

3 Expansion  by  interpolators 

It  was  shown  by  Morris  and  Cheney  [2]  that  any  finite 
carrier  projection  P may  be  written  in  the  form 
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P = ZA,?,1,  EA.  = 1, 
k k k 

where  each  is  a projection  carried  on  n + 1 points, 

i.e.,  an  interpolating  projection  (interpolator).  In  this 

representation,  the  linear  parameters  of  (Ilia")  become  the 

coefficients  A,  ; and  the  symmetrization  may  be  accomplished 

-I  I 

by  including  in  the  sum  P^,  for  each  P^,  and  restricting 
the  coefficients  of  these  interpolators  to  be  the  same. 

THEOREM.  Il^k^ll  is  a convex  function  of  A = (A^,...). 

From  a computational  point  of  view,  this  means  of  repre- 
senting projections  possesses  the  following  advantages: 

- for  fixed  carriers,  a convex  function  is  being 
minimized; 

- for  fixed  carriers,  the  biorthogonal  v's  need  not 
be  computed  for  each  change  in  the  A parameters; 

- for  special  cases,  the  number  of  A's  and  the  number 
of  f-values  in  (II)  may  be  significantly  reduced  (see 
the  results  below  for  the  case  of  n = 2) . 

4 Numerical  results  for  the  quadratic  case 

For  symmetric  finite  carrier  projections  from  C[0,1]  to 
?2[0,1],  explicit  numerical  results  have  been  obtained  for 
the  cases  of  4,  5 and  7 carriers.  Two  simplifying  features 
were  found  to  be  available  in  these  cases,  thereby  further 
reducing  the  complexity  of  the  computations.  First,  it  was 
reasoned  that  the  worst  case,  for  the  maximum  over  the  2m 
f-values  of  (II),  should  occur  for  f(x)  = +1  (0  £ x £ 1/2), 
+1(1/2  < x < 1) , — l(x  =1).  In  each  case,  the  final  result 
was  checked  for  all  2m  of  the  original  choices,  and  the 
validity  of  the  preceding  observation  borne  out.  As  an 
example,  in  the  case  m = 7,  the  number  of  choices  was  re- 
duced from  64  (one  +1  value  could  be  taken  to  be  1)  to  8. 
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A second  observation  showed  that  many  of  the  interpola- 
tors would  not  enter  in  a minimal  projection.  This  fact  was 
also  borne  out  numerically  as  well.  Those  interpolators 
which  did  enter  were  much  fewer  in  number;  for  example,  the 
reduction  was  from  10  to  4 interpolators  in  the  case  m = 7. 

In  the  worst  case  considered  (m  = 7),  the  computations 
involved : 

two  variable  carriers 

- three  linear  parameters  (A^,  A2*  A^) 

- eight  choices  for  f = +1 

- finding  the  extrema  of  a quadratic. 

The  small  number  of  parameters  allowed  searches  to  be  carried 
out  rather  rapidly  on  a Hewlett-Packard  9830A  programmable 
desk  calculator. 

Four  Symmetric  Carriers  (see,  also,  [3;  Theorem  14]). 
min  norm  = 1.24518 

carriers:  0,  s,  1-s,  1 where  s = .46127 

Five  Symmetric  Carriers 
min  norm  = 1.22879 

carriers:  0,  s,  1/2,  1-s,  1 where  s = .2462 

A2 

P = A.P„  n . + -*-[P  . + P 1 ] + A,P  . /0  . 

1 0, 1/2,1  2 o,s,l-s  s , 1— s , 1 3 s, 1/2, 1-s 

where  A = .8980,  A£  = .0872,  A3  = .0148 

Lebesgue  function  exhibited  six  extrema. 

Seven  Symmetric  Carriers 
min  norm  = 1.22349 

carriers:  0,  s,  t,  1/2,  1-t,  1-s,  1 where  s = .1366, 

t = .3938 
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P 


^2 

- \ p 4.  — [ p 4.  p 

10, 1/2,1  2 1 s,t,l-s  s,l-t,l-s 

+ 2“[P0,s,l-s  + Ps,l-s,lJ  + 2~[P0,t,l-t 


] 

+ P 

t. 


l-t,l 


] 


where  ^ = .8300,  A2  = .1144,  \3  = .0319,  = .0237. 

Lebesgue  function  exhibited  10  extrema. 

In  these  results,  P denotes  the  interpolator  carried 

rs  l 

on  r,  s,  and  t.  The  indicated  accuracy  reflects  the  extent 
to  which  the  authors  were  able  to  conduct  a semi-automated 
search  using  the  above-mentioned  machine.  The  authors  are 
indebted  to  California  Investment  Counseling  for  providing 
the  opportunity  to  "bootleg"  time  on  the  H-P  9830A. 
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APPROXIMATION  OF  FIXED  POINTS  OF  CONTRACTIVE 


AND  EXPANDING  OPERATORS 
L.  Collatz 

This  survey  discusses  some  complications  which  occur  in 
applying  approximation  methods  to  problems  in  sciences, 
describes  some  numerical  methods  which  may  be  helpful  in  cer- 
tain cases,  and  gives  numerical  results  for  combination  of 
approximation  and  iteration  methods  also  in  the  case  of  ex- 
panding operators. 

1 Approximation  theory  and  applications 

In  applying  the  theory  of  approximation  of  functions  to 
problems  of  physics,  engineering,  economics  and  other  sciences, 
we  observe  the  following  complications: 

1)  One  has  to  use  more  complicated  classes  of  approximating 
functions  than  the  usual  polynomials,  rational  functions  or 
exponential  functions. 

2)  Functions  of  several  independent  variables  occur  and  one 
cannot  use  the  theory  of  Haar  systems. 

3)  Ma.iy  problems  in  applications  have  several  solutions,  often 
only  one  of  which  is  stable. 

4)  One  has  to  apply  chained  approximation.  This  theory  is 
only  beginning  to  be  developed. 

5)  There  are  often  several  functions  to  approximate,  and  one 
must  consider  simultaneous  approximation,  combi-approximation 
and  field  approximation. 

6)  There  occur  more  complicated  restrictions  than  usually 
considered.  Let  us  illustrate  the  situation  with  some  simple 
examples . 
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2 Error  bounds  and  chained  approximation 

The  nonlinear  boundary  value  problem  for  a function 
u(x,y)  using  polar  coordinates  r,  <J> 

(2.1)  Tu  = -Au  - u - eu  = 0 in  B = {(x,y),  r2  = x2  + y2  < 1} 

(2.2)  u = 0 on  3B  for  r = 1 

2 2 2 2 

with  Au  = 3 u/3x  + 3 u/3y  may  describe  the  stationary  dis- 
tribution of  temperature  in  a homogeneous  circular  disc.  We 
look  for  an  approximate  solution  w(x,y)  satisfying  the 
boundary  condition 

p ? 

(2.3)  u=»w  = Z a (1  - r v)  . 

v=l 

Applying  one-sided  Chebychev-Approximation  (Schumaker-Taylor 
[69])  we  wish  to  choose  the  parameters  a such  that 

(2.4)  0 £ Tw  <_  6 in  B,  6 = Min. 

Then  we  choose  other  values  A for  the  a with 

v v 

(2.5)  -5  <_  Tw  < 0 in  B,  6 = Min. 

For  instance  for  p = 2 we  get  the  approximation  problem 

0 < Tw  = a^(3  - r2)  + a2(-l  + 16t2  + r^)-  exp[a^(l  - r2) 

+ - rS]  £ 6 for  r e [0,1],  6 = Min, 

which  is  a chained  approximation  problem  because  the  param- 
eter a^  (and  also  a^)  occurs  at  two  different  places  (for 

a discussion  of  chained  approximation,  see  Hoffmann  [69,  76], 
Collatz  [75]  Collatz-Krabs  [73]).  Even  for  p = 1 
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(2.6)  0 Tw  = a^(3  - r2)  - expfa^d  - r2)  ] < 6 
for  r e [0,1],  6 = Min 


we  have  the  complications  1),  3),  4)  mentioned  in  section  1. 


Results  of  the  Theory.  Fig.  1 gives  the  graphs  of 

T[a^(l  - r2) ] for  a^  =0.5  and  a^  = 0.6191.  The  4 encircled 

points  are  an  H-set  (compare  Collatz  [56] [69],  G.  D.  Taylor 

[72]).  Therefore  we  have  the  exact 
error  bounds  for  the  stable  solu- 
tion 


(2.7)  0.5(1  - r ) <_  u(x,y) 

< 0.6191(1  - r2)  in  B, 

and  we  know  that  these  are  the  best 
possible  error  bounds  one  can  get 
in  the  linear  manifold  w = 


ax(l  - r ) . 


If  one  wishes  to 
get  sharper  error  bounds  one  has 
to  use  other  manifolds,  for  instance  (2.3)  with  p > 1.  Then 
one  can  easily  improve  the  bounds  (2.7). 


3 Chained  approximation  for  nonlinear 
integral  equations 

Chained  approximation  arises  very  often.  We  look  for  an 

clX 

approximate  solution  w(x)  = e for  the  solution  u(x)  of 
the  Volterra-Equation 


A _ 

(3.1)  u(x)  = Tu(x)  = 1 + J eX  [u(t)]2dt 

o 

in  the  interval  J = [0;0.4] . The  Chebychev  approximation 
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problem 

(3.2)  4>(a)  = || Tw  - w|| 


Max 

xeJ 


x(x+2a) 

e - 1 ax  , . 

- e + 1 


x + 2a 


,*(a)  = Min 


is  chained  and  gives  the  value  a = 1.44  with  <J»(a)  «*  0.060. 

4 Subdomains  with  error  bounds 

Let  the  displacement  u(x,y,t)  of  a membrane  be  describe* 
by  the  wave  equation 

2 2 2 

3 u 3 u 9 u 

(4.1)  Lu  = - — - - — r = 0 

9t  9x  9y 

and  the  initial  boundary  conditions 


(4.2)  u = g(x,y)  = (1  - x2)(l  - y2),  = 0 for 


(x,y)  e B = { x < 1,  |y  < 1},  t = 0 


(4.3)  u = 0 for  (x,y)  e 9B,  t > 0. 


We  look  for  an  approximate  solution  w(x,y,t)  of  the  form 

P 

(4.4)  u as  w(x,y,t)  = Z a [cos(b  x)cos(c  y) 

IV  V V 

? ? 1 /2 

+ cos(c  x)cos(b  y)]cos((b  + c ) t) 

V V V V 

which  satisfies  (4.1)  and  9u/9t  = 0 for  t = 0 for  all 

b , c , and  (4.3)  for  suitable  b , c , for  instance 

V V V V 

b^  = c^  = tt/2  or  b^  = tt / 2 , c ^ ” 3n/2.  The  theorem  on 
pointwise  monotonicity  applied  to  a function  n with 
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(4.5)  Ln  = 0 in  B*(0,-),  = 0 in  B,t  = 0, 

O t 

| n | £ <5  in  B,t  = 0 yields  the  bound  | n | £ 6 in 
the  pyramid  P determined  by  the  characteristics  (see  fig.  2): 

(4.6)  P = {(x,y,t),  | x | <_  1,  |y|  £ 1,  t ^ 0, 

Max(  | x | , |y|)<_l-t}. 

We  use  the  Chebychev  approximation 
problem 

(4.7)  ||w(t  = 0)  - gl^  = Inf 

to  determine  the  parameters  a . 

For  p = 2 we  get  the  values 
a1  = 0.5333,  a£  = -0.0394,  and 
the  exact  error  estimate 

(4.8)  |w  - u|  < 0.0122  in  P. 

The  estimate  (4.8)  is  not  assured 
in  the  whole  domain  B x (0,°°)  . This  is  a typical  situation: 
the  approximation  methods  are  numerically  useful  for  getting 
approximate  solutions  and  in  case  of  validity  of  monotonicity 
theorems  they  also  give  error  bounds. 

5 Local  relaxation  and  splines 

A function  y(x)  may  be  defined  in  the  unbounded  inter- 
val J = [0,+°°)  by  the  nonlinear  differential  equation 

(5.1)  Ty=^-x2+— =0  with  lim  y(x)  * 0. 

dx  y X-H-® 

We  consider  rational  Chebychev  approximation 
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(5.2)  y(x)  = w(x)  = 

where  P,Q  are  polynomials  with  3Q  > 3P,  for  instance 


w(x) 


a + x 

, . 2 3 

b + ax  + x 


The  defect  Tw  (see  Fig.  3)  is  small  for  x >_  xq,  and 

can  be  reduced  by  local  relaxation,  changing  w(x)  into  w(x) 

by 


(5.3)  w ( x) 


^(x)  for  x > x 
— o 

w(x)  + o(x  - 6) (x 


x ) for  0 < x < x . 
o — — o 


Fig.  3 shows  the  defect  before 
and  after  the  relaxation  for  the 
values  a = 0.4887,  b = 0.7984, 
xq  = 0.7,  a = -0.0564,  8 = 26.88. 
This  idea  of  local  relaxation, 
which  was  numerically  successful 
also  in  other  problems,  is  re- 
lated to  the  idea  of  spline- 
approximat ion. 


6 Several  solutions  of  a problem 


Let  w(x,y)  be  an  approximate  solution  for  a fixed 
point  u(x)  of  the  Urysohn- equation 


(6.1)  u(x, y) 


Tu 


-1 


+ I 

B 


2[u(s,t)]  dsdt 
2 + x + y + u(s, t) 


where  B is  the  square  0 < x,  y < 1.  We  choose  w in  the 
simplest  way  as  a constant,  x = const.  = a.  This  leads  to 
the  chained  rational  Chebychev  approximation  problem 
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(6.2)  't(a)  * ||Ta  - a||^  = Max|-1  + 

B 


2a  

2 + x + y + a 


- « . 


$(a)  = Min 


Fig.  4 shows  the 
graph  of  <t>(a)  with 
an  absolute  minimum 
a = -0.615  near  the 
stable  solution  and 
a local  minimum 
a = 4.503  near  the 
unstable  solution 
with  expanding  be- 
haviour. 


7 Numerical  procedure  for  expanding  operators 

The  following  iteration  procedure  has  been  tested  nu- 
merically for  constractive  as  well  as  for  expanding  operators 

T.  We  seek  a solution  u of  u = Tu.  Let  v (x . , a. , 

o j 1 

and  v^(Xj ,a^, . . . ,a  ) be  a pair  of  functions  of  x^,...,xr 
with 


’V 


(7.1)  v^(  Xj  >3^9  ••  *9  ) T Vq  ( Xj  • • • 9 ) • 

We  describe  the  iteration  by  induction.  If  we  have  deter- 
mined  in  the  (p  - 1)  step  values  a^P  ^ for  the  (p  - 1) 

S / \ 

parameters  a (s  = l,...,p-l),  then  we  calculate  values  a^p 
s s 

for  the  p parameters  a (s  = l,...,p)  by 

s 


(7.2)  <J> (a  , . . .,a  ) 

1 P 


v Li  4>(a1.**»»a„) 

O"  oo  1 p 


Inf . 


To  get  the  infimum  of  <f>  approximately,  one  can  start  by 
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using  for  the  a (s  = l,...,p-l)  the  values  a F one 
s s 

has  got  in  the  (p  - 1)  step.  Thus,  at  first  one  takes  only 
one  parameter  and  then  enlarges  the  number  of  parameters  with 
every  step. 

8 Expanding  operators  and  nonlinear 
integral  equations 

Krasnoselskii  [64]  stated  a theorem  for  expanding  op- 
erators. Based  on  cones  with  special  properties,  Sprekels 
[75]  used  this  theorem  to  get  error  bounds  for  expanding  fixed 
points  of  integral  equations  of  Hammers tein  type 


(8.1)  u(x)  = Tu  = J K(x, s) f (s, u(s) )ds 

B 

with  positive  kernel  K(x,s).  He  could  get  good  error  bounds 
if  one  has  a good  approximation  of  K(x,s)  as  a product 
p(x)’q(s).  This  leads  to  the  following  approximation  problem: 


(8.2) 


* < 1 
- p(x)q(s) 


< A ’a 


with  A = const,  o = const,  a = Min.  Sprekels  [75]  applied 
his  theory  to  nonlinear  differential  equations.  I thank  him 
for  the  following  example: 


(8.3)  -y"(x)  = A[y(x)]\  0.2y(0)  - y'(0)  = -O.ly(l) 


+ y'(l)  = 0 

He  got  the  following  inclusion,  valid  for  every  A > 0: 
A_1/3  0.4174  9:-+-  * <_  y(x)  < A_1/3  0.5846. 
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9 Secant  method  (Regula  falsi)  and 
Newton-iteration 

Suppose  that  in  the  interval  J = [a,b]  the  real-valued 

function  f(x)  e C^(J)  has  fixed  signs  of  f'(x)  and  f"(x); 

and  that  f(x)  has  a zero  z.  e (a,b)  with  f(z)  = 0.  We 

start  with  two  values  x ,x,  e J 

c 

and  calculate  x 
method: 


o'  1 
with  the  secant 


x1  - x 

(9.1)  x^  = x^  - j — —jr  f^  with 


f j “ f (Xj)  (j  - 0,1) . 

We  suppose  x^  £ J.  Then  one 
gets  lower  or  upper  bounds  for 
z by  choosing  xq,x^  on  the 
same  or  opposite  side  of  z, 
respectively,  as  shown  in  the 
following  table: 


f'(x)  and  f"(x) 
same  sign 


f'(x)  and  f"(x) 
opposite  sign 


x ,x,  on  the  same  size  of  z 
o 1 


x2  > z 


x2  < z 


xq,x^  on  opposite  sides  of  z 


x2  < z 


x2  > z 


W.  Hofmann  [71]  gives  a generalization  to  equations  Tu  =*  6 
in  Banach  spaces  with  6 the  zero-element.  He  is  working 
with  linear  "difference-operators"  T[u',u"]  with  the  prop- 
erty T[u,,u"](u'  - u")  = Tu'  - Tu"  which  are  described  in 
detail  in  his  paper.  Under  certain  conditions  he  can  prove 
inclusion  theorems  similar  to  the  theorems  given  in  the  table. 
If  T is  Fr6chet-dif f erentiable,  one  gets  similar  inclusion 
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theorems  also  for  the  Newton  procedure.  Good  starting  ele- 
ments x0>xi  are  essential;  one  gets  these  elements  with  the 

aid  of  Approximation  Theory.  I thank  Mr.  Hofmann  for  the 

2 2 

numerical  example:  For  Tu  = -Au  + 2u  = 0 in  B:(r 

2 2 

x + y < 1)  and  u = 1 on  3B:(r  = 1)  monotonicity  and 

Schauder's  Fixed  Point  Theorem  (compare  Collatz  [66])  give 

2 

the  bounds  (easy  to  get)  1 - (1/2)  (1  -r)£u£l- 
(1/4) (1  - r2)  with  1 0 . 6 2 5 - u(0,0) | <0.125  (error  20%). 

The  method  of  Hofmann  improves  this  estimate  to  1 0 . 70594  - 
u(0, 0)  | £ 0.00484  (error  0.68%). 

The  idea  of  using  monotonicity  properties  was  also  used 
by  Voss  [76],  but  in  a different  way  as  by  Hofmann.  Voss  uses 
a decomposition  of  the  operator  T in  the  given  equation 
Tu  = 6;  he  writes  T = T^  - T^  and  supposes  T^  is  an  op- 
erator with  the  same  properties  as  the  "difference  operator" 
of  Hofmann  mentioned  above,  while  T^  is  a monotone  operator. 
To  illustrate  the  decomposition  we  consider  the  following 
initial-value  problem  for  which  I thank  Dr.  Voss: 

y"  - y2  + 2y  = 2x2  = 0,  y(0)  = 1,  y'(0)  = 0 

Here  a possible  decomposition  is 

T^y  = (y"  - y2»y(0) ,y ' (0))  and  T2y  = (-2y  + 2x2,  1,0) 

and  Ty  = T^y  - T2y  maps  the  solution  into  the  zero  element 
of  the  space  C2 [0,1/2]  * * R^. 

Taking  quadratic  polynomials,  one  gets  the  error  bounds 
1 - (1/ 2) x2  < y (x)  < 1. 

10  Inverse  problems  and  not  well  posed  problems 

We  consider  the  case  that  in  the  heat-conduction  equa- 
tion (or  equation  for  diffusion,  concentration  a.o.) 
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i 


(10-1>  Lu  ■ ft  - h [k<x)  f]  -° 

the  unknown  coefficient  k(x)  should  be  determined  from  the 
observations  of  the  temperature  u(x,t)  of  a beam  at  two 
different  times.  As  a simple  case  suppose 


(10.2)  u = 0 for  x=+l,  0 < t < 1 (at  the  ends  of  the  beam 


u = 1 - x for  I x | < 1,  t = 0 


u = -^(1  - x^)  for  | x | <1,  t = 1. 


o 

U 

3 


u ■ 


B 


We  take  as  approximate  terms 


o 

n 

a 


(10.3)  u w(x,t)  = (1  - x ) 


-1 


f-xfc  * * 


t1- 


2 p 
t + (t  - t )•  Z 


a tVl 

0 V J 


fij.6 


(10.4)  k(x)  = Z b x 
u=0  ' 


2y 


The  coefficients  a , b^  are  calculated  from  the  Chebychev 
Approximation 


(10.5)  ||Lw)|  = Inf. 


with  the  maximum  norm  in  the  domain  B : | x | <_  1,  0 <_  t <_  1 
(see  Fig.  6).  One  gets  the  values 
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3 Parameters 

5 Parameters 

a = -0.1716 
o 

a = -0.19240 
o 

a = 0.03986 

~ S = 0.3431 
o 

b = -0.1122 

= 0.34662 

o 

b = -0.11571 
b2  = 0.00001 

IIHL  - °-5147 

||Lw||  = 0.00088 

The  function  Lw  has  5 extrema  in  B+  = B H {x  >_  0};  the 
minimum  of  the  modulus  of  these  5 extrema  is  0.00083.  Fig.  7 
shows  the  graph  of  k(x)  for  q = 2,  the  domain  B with 
the  dotted  line  Lw  = 0,  and  the  extremal  points  of  Lw.  It 
is  remarkable  that  for  k(x)  one  gets  nearly  the  same  result 

for  q = 2 as  for  q = 1, 
(difference  =«  1%) . 

Many  other  not  well  posed  problems 
are  well  known;  we  mention  only 
the  following  two  problems  coming 
from  applications:  1.)  For  the 

wave  equation  with  given  coef- 
ficient k(x) 

(10.6)  LU  - - fe(k00£)  - 0 

we  consider  the  observed  displace- 
ment u(x,  t)  for  |x|  <_  1,  t = 0 
For  x * +1,  0 < t < 1 we  sup- 
pose u = 0,  and  we  want  to  calculate  u(x,t)  in  the  interior 
of  the  rectangle  B. 

2.)  In  the  equation  for  diffusion  u(x,t) 

(10.7)  Lu  - - 0 , 

3xz 


and  t = 1 (as  in  Fig.  6) 
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. x , t > 0,  if  u ts 

calculate  u(x,t)  for  ° 0 and  W'<>x 

fo,  x - 0,  t > 0 ana  0 11  ° we  have  to 

glven  for  „ Q t > 0 (ate  Fig.  8)  ■ ,Mtoxl«atlon. 

is  given  tor  * ’ apply  eimnltaneous  ^ [76). 

compare  Bredendiek 

it  is  easy  to  enlarge 
0£  course,  „hich 

ehe  number  ot  examples 
2 numerical  methods  ot  approxi 

mation  theory  are  useful. 

_ auth0r  thanks  Mrs.  Susanne 

d Mr  Rolf  Wildhack 
BBttger  and  • calculation 

very  much  for  numerica 
on  a computer. 
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Z.  Ditzian 

For  an  approximation  process  A f a saturation  theorem 
determines  the  optimal  rate  of  convergence.  This  rate,  being 
exceeded  only  for  a limited,  "trivial",  class  of  functions, 
is  achieved  on  a "large"  class  of  functions.  Saturation 
classes  and  theorems  were  introduced  by  Favard  and  investi- 
gated by  many  authors  for  various  operators,  but  as  far  as 
the  author  is  aware,  the  "trivial"  class  was  always  a finite 
dimensional  space  at  most.  The  main  difference  here  is  that 
we  will  obtain  an  infinite  dimensional  but  non-dense  subspace 
for  our  trivial  class. 


We  shall  deal  with  convolution  approximation  operators 
defined  as  follows: 


(1.1)  A f(x)  = 
n 


f(x-t)  dp  (t) 
n 


where  t = (t^,***,^),  x = (x^.-'-.x^)  and 


(1.2) 


dp 

(t) 

= 1 and 

Rk 

some 

n 

2 

sequence  o^ 

lim 

Q 

1 

to 

’ 

t .t . 

n -*■  <» 

n J 

k 1 j 

lim 

Q 

1 

to 

R 

r 

t . dp 

n -►  * 

n j 

U 1 

ij 


il 


and 


(1.5)  lim  a 


-2 


n -*•  00 


dp  (t)  = 0 6 > 0 B (0, 6)={ t ; t <6 

n 


R -B(0, 6) 
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One  can  check  that  many  convolution  approximation 


operators  satisfy  the  above  conditions.  For  instance: 

k 

(a)  A f (x)  = y 
n n 


exp{-n  l t } f (x-t)dt ; 
,k  i=l 


(b)  A f (x)  = y (r) 
n n 


R 


k i=l 


sin (n  t ±/2) 1 


sin(t^/2) 


2r . 


f (x-t)dt , 


r ^ integer,  r^  _>  2; 


(c)  Anf(x)  - [V(b(0  i))]'1' 


f (x-t)dt ; 


or 


(d)  A f (x)  = y 
n n 


E(0,i) 


f (x-t)dt» 


where 


E(0,i)  - It:  (t1+t2,2  + + i t2  +--±t 


1 .2 
k 


2 . The  local  small  o saturation  result 

The  key  to  our  investigation  is  the  result  of  this  sec- 
tion. (However  this  is  not  the  most  difficult  result.) 

THEOREM  2.1.  For  A f satisfying  (1.1),  (1.2),  (1.3),  (1.4) 

and  (1.5),  we  have,  for  E(B^)  being  L^B^)  l j<  p < <»  or_ 

C(B  ) (where  B = B(x  ,r)), 

r r o 2 

<a)  l|Anf-f||E(B  )*  0<\>  iSEiiss.  P(D)f  - l Aij  ■ 

r j,i  i j 

in  B and  f e C in  B ; 

— r — r 

(b)  P (D)  f = 0 in  B implies  ||  A f-f  ||  . . = o(o2). 

r n lvd  f y n 

r-6 

OO 

Proof . To  prove  (a),  we  observe  that  for  d>  € (^(B^) 
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(supp  <J>  c B^)  we  have 

lim  <o  ^ (A  f — f ) , t)>>  = 0, 
v n n 

n ■>  00 

* 

since  4>  e L^  l<^p<<»  and  4>  e L^  and  therefore 

<t>dx  = u(A)  is  a measure  and  in  the  dual  of  C.  This 
A 

implies 

<a  ^ (A  f-f),<}>>  = <f,o  ^ (A  = 

n n n n 


<f  ,P  ( — D)  4>  > + o (1)  = <f,P(D)<}>>  + o(l)  = o(l) 

or  P(D)f  = 0 weakly  as  a distribution  in  B^.  Since  P(D) 

is  elliptic  (as  can  easily  be  shown),  Apriory  estimates 

00 

[1,  p.66]  can  be  used  to  show  that  f e C (B^) , and  there- 
fore P(D)f  = 0 not  only  in  the  distributional  sense. 

The  proof  of  (b)  is  mere  computation  if  we  know  that 
P(D)f  =0  in  ECB^)  implies  f e C (B^) . This  can  be 
obtained  by  using  some  Apriory  estimates. 

3.  The  local  big  0 theorem 


THEOREM  3.1.  Let  A^f  be;  defined  as  in  Theorem  2.1,  then 
for  1 < p < 00  we  have 

(a)  HAnf-f  HL  (B  )=  °(°n)  iBEMes  f e W 2 (B  ) ; 
p r 

O’)  f‘H2,p(Br>  ilEiiSi  (B 

r p r-6 


(where  W„  is  the  Sobolev  space  of  functions  with  2 
^ > P 

derivatives  in  L ). 

p 

THEOREM  3.2.  Let  A^f  be^  defined  as  in  Theorem  2.1,  then 
for  E = C (or  E = L^) 

(a)  llAnf_fllE(B  )=  0 (°^)  implies  P(D)f  e L^B^) 
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r 

(or  I P(D)f  dx  = p(e)  ij>  £ measure  of  bounded  var Iat Ion  on 
e 


B £)  where  P(D)f  is  taken  in  the  distributional  sense; 

r— o 

(b)  f e L„  (B  ) implies  || A f-f||0/„  \ = 0 (a2); 

2,°°  r — n C (B  ) n 

r-6 

f e L,  , (B  ) and  the  derivative  of 
i , i r 


II A f-f 


f ij3  a measure  of  bounded  variation  implying 


* / tj  v = 0(a) 

L1  Br-6 


It  is  not  possible  to  give  a complete  proof  in  the  small 
space  allotted  to  this  paper.  However  we  note  that  the  main 
difference  between  3.1  and  3.2  is  a result  of  the  developed 
a priori  estimate  for  L , 1 < p < 00  (see  Nirenberg  (6]  and 

Calderon  Zygmund  [3]),  which  does  not  apply  to  L nor  L . 

X 00 

Other  parts  of  the  proof  follow  in  part  the  author's  result 
for  convolution  operators  [4]  as  well  as  those  of  Kozima 
and  Sunouchi  [5]  and  Butzer  Nessel  [2]. 


4 . Global  results 

Many  of  the  above  results  have  global  analogues.  For 
instance,  in  the  case  of  the  Gauss-Weierstrass  transform: 


* 

(4.1)  Gn*f  = exp f-n(x-y)2) f (y)dy  x,y  c Rk, 

it  was  proved  that  ||Gn*f-f||  = of—)  if  and  only  if  f = 0. 

2 P 2 

It  is  clear  that  P(D)  = Z 9 /3x^  and  indeed  the  only  har- 
monic function  that  belongs  to  L is  f = 0.  However 

- 2 1 P 
||  e X (G  *f-f)||  = of—)  , n > 2,  implies  f is  harmonic  but 

n „ " E vn'  — 

ii  | x | 2 

| f | = o(e  1 ) x -*  °°.  For  other  sequences  of  operators 
different  weight  functions  may  be  needed. 

5.  Remarks 


The  results  are  applicable  to  convolution  operators  on 
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■n  where  the  functions  are  periodic.  A non-convolution 
multidimensional  sequence  of  positive  operators  leads  some- 
times (as  in  the  case  of  Bernstein  or  Baskakov  operators)  to 
a solution  of  elliptic  equations  with  non-constant 
coefficients . 
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P.  ErdOs 

The  subject  has  very  many  aspects  and  ramifications  and 
I am  not  at  all  competent  to  deal  with  many  of  them  and  in 
fact  in  this  short  article  I do  not  even  have  the  space.  Thus 
I will  eventually  concentrate  on  problems  which  my  collabora- 
tors and  I investigated.  I will  try  to  review  them  as  systemati- 
cally as  possible  and  will  also  state  a few  new  problems. 

Herzog,  Piranian  and  I in  a paper  [7]  (we  will  refer  to 
this  paper  as  (I))  investigated  several  geometric  extremal 
problems  for  polynomials;  we  stated  there  many  conjectures 
some  of  which  were  settled  (positively  or  negatively)  by 
Pommerenke  and  others.  First  I give  a short  review  of  the 
fate  of  the  conjecture  stated  in  (I).  Pommerenke' s two  papers 
in  which  he  deals  with  the  problems  in  (I)  are  Pommerenke 
[12]  and  [14];  we  will  refer  to  them  as  (PI)  and  (PII), 
respectively.  Some  of  these  problems  are  discussed  by 
Pommerenke  also  in  [13]. 

1.  Let  -1  < x,  <_...£.  x <.1.  We  conjectured  and  Elbert 

— 1 n J 

[4]  proved  that  the  measure  of  the  set  of  real  numbers  x 

for  which  | IT1?  (x  - x ) | <.  1 is  not  greater  than  2^2. 

“i  2 1 

The  polynomial  1 - x shows  that  this  is  best  possible.  The 
proof  of  Elbert  is  surprisingly  complicated  and  a simpler  one 
would  be  desirable. 

2.  Put  f (z)  = JJ1?  ,(z  - z . ) . Denote  by  E (f)  the  set  of 

n 1=1  1 n 

points  where  | f ^ ( z) ] <.  1.  P61ya  proved  that  the  area  A^(E) 
of  E^(f)  Is  — 1T*  with  equality  only  for  zn  and  he  also 
proved  that  the  projection  of  E^(f)  on  every  line  has 

Preceding  page  blank 
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measure  < 4.  Denote  e = minA  (E)  where  the  minimum  is  to 

n n 

be  taken  over  all  f^(z)  where  roots  are  all  in  |z|  <_  1. 

Using  a deep  theorem  of  G.  MacLane  we  proved  -*■  0.  We 

conjectured  that  > n for  every  n > 0 and 

n > n^Cn),  we  also  conjectured  that  one  can  place  a circle 

of  radius  Cn  ^ is  A (E) . Both  these  conjectures  are  still 

n 

open,  but  Pommerenke  proved  some  weaker  results.  More  gen- 
erally in  (I)  we  conjectured  that  if  T is  any  set  of  trans- 

finite  diameter  1 then  to  every  e > 0,  for  n > n (e) 

o 

there  is  a polynomial  f (z)  = n”  , (z  - z ) with  z e T 

n i=l  i i 

so  that  A (E)  < e.  This  is  easy  if  T is  (-2, +2)  but  the 
n 

general  case  is  open. 

Netaniahu  and  I [8]  proved  that  there  is  a constant  p 

c 

depending  on  c alone  so  that  if  T is  a connected  set 

of  transfinite  diameter  1-c  then  E (f)  contains  a circle  of 

n 

radius  p^  . Our  proof  is  a pure  existence  proof  and  does 

not  give  an  explicit  bound  for  p . It  would  be  interesting 

c 

to  determine  (or  estimate)  the  best  possible  value  of  p^ 

and  also  the  best  possible  value  of  A^(E).  The  following 

special  case  might  be  of  some  interest.  Let  the  roots  of 

f (z)  be  in  I z I £.  1-c.  Then  E (f)  contains  the 

R n ^ 

circle  |z|  <_  c and  thus  has  area  i ire  . I wonder  if  this  is 

best  possible:  Is  it  true  that  to  every  e > 0 there  is  an 

f^(z)  with  | z ^ | <_  1-c  and  A^(E)  K uc  + e ? 

We  proved  in  (I)  that  there  is  a polynomial 

f^Cz),  |z^|  1,  for  which  E^(f)  has  n - 1 components  and 

that  it  never  can  have  n components.  As  far  as  we  know 

the  following  question  has  not  been  investigated:  Let  T 

be  a set  of  transfinite  diameter  1.  Is  it  true  that  for 

each  n there  is  a polynomial  f^(z)  all  whose  roots  are  in 

T and  for  which  E(f)  has  >.  n - Yn  components  where  y^ 

is  "small"  (certainly  o(n)  and  perhaps  bounded;  if  T is 

(-2, +2),  then  y = 0)?  If  T has  transfinite  diameter  1-c 
n 
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then  in  general  y > f(c)n,  but  as  far  as  we  know  also  this 
n 

question  has  not  been  investigated. 


3.  A pretty  theorem  of  H.  Cartan  states  that  for  a polynomial 

fn(z)  = n"(z  - z^),  i = l,...,n,  the  lemniscate 

E (f)  can  always  be  covered  by  circles  the  sum  of  whose  radii 
n 

is  less  than  2e.  We  conjectured  in  (I)  that  the  correct  value 

here  should  be  2 (clearly  best  possible  if  true).  We  also 

conjectured  that  if  E (f)  is  connected  then  it  is  contained 

n 1 

in  a circle  of  radius  2 and  center  — Z.  ,z..  This  conjecture 

n i=l  l 

was  proved  by  Pommerenke  (PI). 

Two  conjectures  of  (I)  which  seem  to  me  the  most  attrac- 
tive are  as  follows:  The  sum  of  the  diameters  of  the  components 

of  En(f)  is  n2^n  (equality  for  zn  - 1)  and  the  length 

of  the  curve  If  (z) I =1  is  maximal  for  f (z)  = zn  - 1 . 
n n 

As  far  as  we  know  no  real  progress  has  been  made  with  these 
conjectures  though  Pommerenke  (PII)  proved  that  the  length 

of  |f  (z) | =1  is  less  than  74n  — whereas  the  "truth"  should 

1 n ' 

be  2n  + 0(1) . 

In  (I)  we  made  the  ill  fated  conjecture  that  the  number 

of  components  of  E^ff)  which  have  a diameter  > 1 + c is 

less  than  6 , <5  bounded.  Pommerenke  (PII)  showed  that 
c c 

nothing  could  be  farther  from  the  truth,  in  fact  he  showed 

that  to  every  e and  k there  is  an  E^tf)  which  has  more 

than  k components  of  diameter  > 4 - e.  Our  conjecture  can 

probably  be  saved  as  follows:  Denote  by  <j>  (c)  the  largest 

n 

number  of  components  of  diameter  > 1 + c which  E^(f) 
can  have.  Surely  for  every  c > 0^  $n(c)  = o(n)  and  hope- 
fully ^(c)  = o(nc).  I have  no  guess  about  a lower  bound 
for  ^(c)  also  I am  not  sure  whether  the  growth  of  ^(c), 

1 < c < 4,  depends  on  c very  much. 

In  (I)  we  conjectured  that  if  E (f)  is  connected  its 

n 

circumference  is  >_  2tt,  equality  only  for  zn.  In  (PI) 
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Pommeneke  proved  this.  As  far  as  I know  the  following  ques- 


tion has  not  yet  been  investigated.  Assume  that  E^(f)  is 
convex;  how  large  can  its  circumference  be?  Perhaps  8 is 
the  correct  bound. 

Assume  that  E (f)  is  connected,  let  d be  its  diameter, 
n 

let  b be  its  width.  We  conjectured  that  b <.  2 but  in 

(P I I ) Pommerenke  showed  that  b > 2.386  is  possible,  he 

further  showed  that  b < 2.920  and  that  to  every  E^Cf)  there 

is  a direction  so  that  the  proj ection  of  E (f)  on  this  direction 

n 

has  measure  < 3.30. 

We  conjectured  that  if  En(f)  is  connected  then  to 

every  z^  on  the  boundary  of  E^(f)  there  is  a z e E^Cf) 

with  z - z I > 2.  Pommerenke  in  (PII)  disproves  this  and 
o 

shows  that  2 can  be  replaced  by  (3/4)/3  which  is  best 
possible . 

Let 


z^  - Zj  [ <_  2,  1 <_  i < j <_  n.  We  conjectured  that 


n 


Z,  - z. 


assumes  its  maximum  if  the  z^  are  the 


l^icjln1  i j 
vertices  of  a regular  polygon  of  diameter  1.  Danzer  and 

Pommerenke  [3]  disproved  this  conjecture  for  even  n but  it 

probably  holds  for  odd  n.  As  far  as  I know  it  is  open  for 

n >.  5. 

Assume  that  E^Cf)  has  n components.  Is  it  true  that 

its  area  is  maximal  if  f (z)  = zn  - 1?  If  true  this 

n 

probably  will  not  be  easy  to  prove.  On  the  other  hand  the 
following  conjecture  (If  true)  would  be  probably  not  difficult 


to  establish.  Let 


path  joining  the  origin  to  the  circle 


= 1,  1 <.  i <_  n.  Then  there  is  a 

1/n 


= 2 


on  which 


Cl(z 


z£)  | <.  1. 


4.  Many  of  these  problems  can  be  extended  to  higher  di- 
mensional spaces  or  metric  spaces.  Let  us  here  restrict 
ourselves  to  three  dimensional  Euclidean  space  with  distance 
d(z,z').  If  z1t...,z  are  n points  in  space,  denote  by 
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E (z.)  the  set  of  z's  for  which  IT?  ,d(z,z.)  < 1.  Is  it 
n i i-1  i — 

true  that  if  E (z.)  is  connected  it  is  in  the  interior  of 

n 1 In 

the  sphere  of  radius  2 and  center  — Z,  , z.?  What  is  the 

2 n i=l  l 

maximum  of  the  volume  of  E (zj  ? I first  thought  that  the 

n i 

maximum  will  be  attained  for  the  unit  sphere  , i.e.,  in 

case  when  all  the  points  z^  coincide,  but  Piranian  showed 

that  this  is  false  already  for  n = 2.  It  would  be  very 

interesting  to  determine  the  maximal  volume  of  E (z.)  and 

n i 

the  distribution  of  the  z^  which  achieves  this  maximum. 

Many  of  these  problems  are  of  great  geometric  interest. 

In  the  plane  the  regular  polygon  usually  gives  the  solution, 

but  nothing  corresponds  to  this  in  three  dimensions.  Here 

is  a typical  example:  A well  known  theorem  of  Pdlya  states 

that  if  |z.|  <_  1 then  In  (z.  - z.)|  is  maximal 

i l<jt<j<n  l j 1 

if  the  zi  are  the  vertices  of  the  regular  n-gon.  Assume 

now  that  the  z^  are  in  the  unit  sphere;  I can  not  even  guess 

for  which  set  of  n points  z^,  1 <_  i <.  n is  the  maximum 

of  d(z,z^)  | assumed.  Similarly  I can  not  guess  for 

which  set  z.  is  the  area  of  fl,  . . d(z.,z.)  = 1 maximal’ 
i l<i<jln  i j 

(the  Zj  are  unrestricted  here). 


5.  Let  f^Cz)  = n”_i(z  - z ^ ) and  assume  that  E (f) 

i i 2 

connected.  I coniectured  that  max  „ f'(z)  <n  /2 

2 z£En(f)  n 2 

Pommerenke  proved  this  with  en  /2  instead  of  n /2  . 


is 

Let 


a ^ be  the  distance  of  z.  to  the  boundaries  of  the  lemniscate . 
Perhaps  — n(2^n  - 1);  equality  for  z°  - 1. 

> c could  remain  true  if  instead  of  connectedness 
we  assume  | z ^ | £_  1 . 

As  far  as  I remember  we  never  considered  the  following 

questions  which  are  perhaps  not  uninteresting.  Let  E (f) 

n 

be  connected,  when  is  its  area  minimal?  Probably  when  the 
roots  are  all  real  and  max|f(z)|  = 1 between  any  two  con- 


secutive roots.  A related  question:  Let  connected.  ; 
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When  is  (f)l^n^z^l  (area  integral)  maximal  and  minimal? 

The  maximumnis  probably  achieved  for  zn  and  the  minimum  when 
the  roots  are  all  real  and  between  any  two  roots  |f^(z)| 
assumes  the  value  1. 

In  some  cases  connectedness  of  E (f)  can  perhaps  be 

n 

replaced  by  the  following  condition:  No  line  separates  the 

components  of  E (f).  For  example:  Is  it  true  that  E (f) 

n ^ n 

is  contained  in  a circle  of  radius  2 and  center  — E,  , z. 

n i=l  i 

under  this  assumption?  For  this  section  compare  [9]. 

6.  I conjectured  35  years  ago  that  if  f (0)  is  a trig- 

n 

onometric  polynomial  whose  maximum  is  1 and  all  whose 
roots  are  real  then 


2 7T 

/ | f (0)  | <.  A. 

o n 


Let  the  rational  polynomial  f^(x)  have  all  its  roots 


in  (-1,+1),  max 


f (x) I = 1 and  let  x.,  x. 


- 1<x<1 ' n 

two  consecutive  roots  of  f (x) . Then 

n 


i+1 


be 


I 1+1|fn(x)|  < dn(x.+1  - x.) 


where 


T»<*>  * 


is  the  Tchebicheff  polynomial  and 
consecutive  roots  of  T . 


y i > yi+1  are  two 


n 

These  conjectures  and  more  have  all  been  proved  recently 

by  Kristiansen  [11]  and  Saff  and  Shell-Small  [15]. 

I proved  [5]  that  the  arc  length  from  0 to  2tt  of  a 

trigonometric  polynomial  f of  degree  n satisfying 

If  (0)|  <1  is  maximal  for  cos  n0.  Let  0 < a < b < 2ir, 
n ' 
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is  it  still  true  that  the  variation  and  arc-length  in  (a,b) 
is  maximal  for  cos(n0  + a)  for  a suitable  a?  Also  if 
|f^(x) | i 1,  -1  < x < 1 is  a rational  polynomial  of  degree 
n,  is  it  true  that  T^(x)  has  the  greatest  arc-length?  The 
answer  clearly  must  be  yes — only  a proof  is  needed. 

7.  I would  like  to  call  attention  to  two  old  problems  of 

mine  which  perhaps  belong  more  to  number  theory.  Let 

z , n = 1,  2,...,|z  I =1  be  an  infinite  sequence.  Put 
n n 

n 

A = max  II  1 z - z . | . 

" |z|-  1 i-1 

Prove  (or  disprove)  lim  A^=  °°.  This  problem  is  probably 
difficult.  Let  be  the  least  upper  bound  of  the  numbers 

m k 

Z z . , m — 1.2..... 
i-1  1 


It  is  easy  to  see  that  a sequence  {z^ },  j = 1,2,...  exist 

for  which  B < Ck.  I conjecture  lim(B  /k)  > 0.  Clunie  [2] 

K 1/2  k 

proved  lim(B^/k  ) > 0. 

These  two  problems  really  belong  to  a chapter  called 
irregularities  of  distribution  of  diaphantine  analysis, a 
subject  to  which  K.  F.  Roth  contributed  many  deep  results. 


8.  Finally  I state  a few  miscellaneous  problems  on  poly- 


nomials. First  an  old  problem  of  mine:  Let 


-1  x1  < 


< x <_  1.  Let  H (x  ) = 1,  £ (x.)  = 0 for  k / i be  the 
n K.  K.  iC  1 

fundamental  functions  of  the  Lagrange  interpolation.  Prove 

(or  disprove)  (x~  =-l,  x ,,  = 1)  that 
U n+.L 


n 

(1)  min  max  Z | £,  (x)  | < c 

0<±<n+l  x_<x<xi+i  k=l 

I proved  the  much  weaker  result  with 


log  n. 


n1/2 


instead 


of 


c log  n.  Perhaps  I overlooked  a simple  approach  but  I 
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never  got  anywhere  with  (1) . 

The  following  questions  can  not  be  difficult  and  the 

answers  are  perhaps  known:  Let  z^,...,z  be  n points  on 

the  unit  circle.  Let  P (z)  = II?  , (z  - z.),  let  y.  be  the 

n i=i  1 i 

point  on  the  arc  (z^,z^+^)  where  | ( z ) | assumes  its  maxi- 


mum.  Is 

it  true  that 

n 

n 

(2)  n 

P (y.)|  < 2n  and  Z |P  (y.)  1 > 2n? 

'n  i — , , 1 n i 1 — 

i=l 

i=l 

There  is 

equality  in  (2)  for  zn  - 1. 

Let 

|w. | =1,  1 i j i n + 1,  Pn^z)  = ni=i(z 

1 z±  1 = !■ 

But 

A (w. , . . . ,w  , , ) = jnin  max  |P  (w  .)  I. 
n 1 n+1  pn  1 <j  <n+l  n j 


n+1 


Is  it  true  that  A (w. , . . . ,w  ) is  maximal  if  w.  = 1? 

n i n+1  j 

i.e.,  if  the  wj 's  are  the  (n+l)-st  roots  of  unity.  Determine 
the  extreme  value.  This  surely  must  be  simple  but  at  the 


moment  I do  not  know  the  answer. 

Let  |zi|  <_  1,  1 < i < n,  fn(z)  = ni=i^z  “ 


Put 


A(f  ) = max  max  f (z) i min  f (z)  . 
n n ,ii  n ii  1 n 

0<r<l  | z | =r  | z | =r 

How  large  is  max  A(f  )? 

f n 

n 

Some  of  these  questions  may  not  be  "serious"  Mathematics 
but  I am  sure  the  following  final  problem  considered  by  D.  J. 
Newman  and  myself  for  a long  time  is  both  difficult  and  in- 
teresting: Let  = + 1.  Is  it  true  that  there  is  an  abso- 
lute constant  c so  that  for  every  choice  of  the 

max  | £ e,  z^|  > (1  + c)n^^? 

| z | =1  k=l 
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This  probably  remains  true  if  the  condition  = +1  is  re- 
placed by  |ejJ  = 1.  For  this  section,  see  [1],  [6], 
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SOME  NONLINEAR  VARIATIONAL  PROBLEMS 


Stephen  D.  Fisher 

We  derive  a necessary  condition  that  a solution  of  the 
variational  problem  inf{j]Tu||:u  e U}  must  satisfy  and  show 
how  this  condition  can  be  used  to  solve  certain  nonlinear 
equations  and  to  derive  detailed  properties  of  the  solution 
of  the  minimum  curvature  problem. 

1 Introduction  and  main  results 

Let  X and  Y be  real  Banach  spaces,  let  U be  a subset 
of  X and  let  T be  a continuous  map  of  U into  Y.  We 
shall  investigate  the  existence  and  nature  of  solutions  of  che 
variational  problem 


(1)  a = in f { 1 1 Tu ] j :u  e U} 


Of  particular  interest  is  the  case  when  Y is  L or 

some  closed  subspace  hereof  and  for  this  reason  we  shall  in 

the  following  restrict  ourselves  to  the  situation  when  X and 

Y are  dual  spaces  of  separable  Banach  spaces  W and  Z, 

respectively.  We  shall  also  assume  that  we  are  given  weak-* 

continuous  Frechet  differentiable  functionals  £,...,  I on 

° m 


X and  real  numbers  r 


m 


and  that 


(2)  U = {x  e X: (x)  — r j » J = 0, . . . ,m}  . 

If  any  is  linear  then  the  equality  SL^  (x)  = r^  may  be 

used  in  the  definition  of  U without  any  change  in  the  con- 
clusions. The  following  is  the  simplest  existence  theorem. 

Preceding  page  blank 
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THEOREM  1.  _If_  T is  weak-*continuous  and  if  there  is  a 
bounded  minimizing  sequence  for  (1),  then  there  is  a solution 
Tuq  of  (1). 

In  order  to  obtain  information  about  the  solution  of  (1) 
we  assume  that  T is  Frechet  differentiable  at  u ; that  is, 
there  is  a bounded  linear  operator  L from  X into  Y with 

(3)  T(uq  + 6x)  = Tuq  + 6Lx  + 0(62),  6 + 0 

whenever  u + 6x  lies  in  the  domain  of  T for  all  small 
o 

enough  6. 

THEOREM  2.  Let  uq  e U be  a solution  of  (1),  let  J consist 

of  those  indices  for  which  1 . (u  ) = r.,  let  i\  be  the 
^ J t)  j J 

Frechet  derivative  of  l,  at  u and  set 
3 o 

V = {x  e X:£.j(v)  <_  0 for  all  j e J}, 

Vq  = {x  e X:Jtj(v)  = 0 for  all  j e j}. 

Let  L be  the  Frechet  derivative  of  T at^  uq  and  suppose 
that  L is  weak-*  continuous  and  that  LV  has  finite 

o 

codimension  in  Y.  Let 


*V  = {m  € Z:m(Lv)  >_  0 for  all  v e V}. 


Then 


(4)  a = inf{||Tuo  + Lv|)  :v  e V} 
and 


(5)  a = max{  |m(TuQ)  |:||m||  il,  m e *V}. 
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The  proof  of  Theorem  2 is  obtained  by  using  (3)  to  es- 
tablish (4)  and  then  using  duality  to  obtain  (5)  making  crucial 

use  of  the  fact  that  LV  has  finite  codimension  in  Y to  in- 

o 

sure  that  the  equality  in  (5)  is  actually  attained  for  some 
m t *V.  Note  that  if  some  is  linear,  then  the  equality 

(x)  = 0 may  be  used  in  the  definition  of  V. 

COROLLARY  3.  If  Y = L°°(fi,y)  then  there  is  an  h e L1(fi,y) 
with  norm  1 and 

(i)  0 < JhLv,  v e V 

(ii)  0 <_  hTu  a .e  .y 

o 

(iii)  |Tu  I = a a.e.y  where  h ^ 0. 

o 

Proof.  (i)  follows  from  Theorem  2 while  (ii)  and  (iii)  are 
consequences  of  equality  in  Holder's  inequality. 

COROLLARY  4.  The  number  a is  the  distance  from  Tu  to  LV  ; in 
o — o — 

particular,  if  there  are  no  restraints,  then  a is  the  dis- 

tance  from  Tu  to  LX. 

o — 

2 Applications  to  nonlinear  equations 

EXAMPLE  1.  Let  F(t,x),  t e [0,1],  x t lRn,  be  C^  and  con- 
sider the  initial-value  problem 

(6)  y^(t)  = F(t,y(t) , . . . ,y^n-1^  (t)) 

y^  (0)  - r , j = 0, . . . ,n-l . 

Set  X = Wn’°°(0,l) , Ty(t)  = y(n)(t)  - F(t,y(t) , . . . ,y (n-1)  (t) ) , 

My)  = y(J)(0)  for  j = 0 n-1.  Then  V = ^’"(0,1)  and 

J o 

the  Frechet  derivative  of  T at  u is 
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Lg(t)  = g(n)(t)  + A (t)g(j)(t) 
j=o  3 

where  A are  continuous  functions  depending  on  F 

o n-i 

oo 

and  u . Hence,  L maps  V onto  Y = L (0,1).  Therefore, 
o o 

in  order  to  conclude  that  (6)  has  a solution  we  just  need  to 
impose  conditions  on  F(t,x)  which  assure  that  (1)  has  a 
solution.  One  such  condition  is  that 

|F(t,x)|  < (1  - 6) |x|  + M,  some  6 > 0,  M > 0. 

Another  is 

|F(t,x)|  <.  C|xn|  + M,  some  C,M  > 0. 

EXAMPLE  2.  Let  g be  a smooth  monotone  increasing  function 
on  ft  with  |g(x)|/|x|  ->-0  as  x -*■  . Let  f e L (0,1) 

and  consider  the  nonlinear  boundary-value  problem 

(7)  (-l)Ku(2K)(t)  + g(u(t))=  f(t) 

u(v)(0)  = a , u(v)(l)  = b , v = 0, . . . , K-l. 

Here  we  put  X = W2K’°°(0,1),  Y = L°°(0,1),  £yu  = u(v)(0) 

for  v = 0,...,K-1  and  fc^u  = u^V  ^^(1)  for  v = K, ...,2K-1. 

Let 

(Tu) (t ) = (-l)Ku(2K)(t)  + g(u(t))  - f(t). 

Then  the  Frechet  derivative  of  T at  u is 

o 

(Lv)(t)  = (-l)Kv(2K)(t)  + g'(uo(t))v(t) 

OO 

which  maps  onto  L showing  that  (7)  has  a solution; 
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the  growth  condition  on  g guarantees  that  the  minimization 
problem  has  a solution. 

EXAMPLE  3.  Let  L be  a given  number  and  g a function  in 
o 

LP(0,L  ),  1 < p <_  00 . Then  there  is  a curve,  parametrized  by 
o 

arc-length,  with  length  Lq  and  curvature  g.  That  is,  there 
is  a pair  of  functions  x(t),  y(t)  e W2,P(0,Lq)  with 

(a)  i2(t)  + y2 (t)  = 1 

(b)  x(t)y(t)  - X(t)y(t)  = g(t),  0 <.  t <.  Lq. 

This  is  proved  by  letting  U consist  of  all  pairs  (x,y) 
in  X = W2,P(0,Lq)  (b  W2,P(0,Lq)  which  satisfy  (a)  and  set- 
ting T(x,y)  = g - <(x,y)  where  <(x,y)  is  the  curvature, 
given  in  this  case  by  the  left-hand  side  of  (b). 

3 Minimum  curvature 

As  a last  application  of  Theorem  2 and  Corollary  3 we 
investigate  the  following  problem:  Let  P = {p^,..., p^ } 

be  a set  of  distinct  points  in  the  plane.  We  wish  to  pass  a 

smooth  curve  t -*  (x(t),y(t))  through  the  set  P whose 

00 

curvature,  measured  in  the  L norm,  is  as  small  as  possible. 

We  impose  the  constraint  that  the  lengths  of  the  competing 

curves  have  a uniform  bound  L for  otherwise  the  Infimum  of 

o 

the  curvatures  is  zero  and.  except  in  the  trivial  case  when 
all  the  points  lie  on  a single  straight  line,  there  is  no 
smooth  curve  with  zero  curavture  passing  through  all  the 
points.  When  the  arc-length  parametrization  is  used  the 
curvature  formula 

T (x, y)  (t ) = (x(t)y(*-''  - x(t)y(t) ) (x(t)2  + y(t)2)  ^2 
simplifies  to 
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t2  = x(t)2  + y(t)2 

and  hence  every  minimizing  sequence  is  bounded  in 

2 oo  _ 2 °° 

X = W ’ (0, L ) ® VT’  (0, L ).  Let  (x,y)  be  a solution 
00 

parametrized  by  arc-length  and  let  points  0 <.  t^  <.  . . . <. 
t^  be  chosen  with  (x(tj),y(t.))  = p^,  j = 1,  . . . ,N. 

We  let  £j(u,v)  = (u(t . ) ,v(t^ ) ) for  j = 1 N and 

0 , s r^o ,.2  .2,1/2 

L (u,v)  = I (u  + v ) 

0 0 

so  that  the  nonlinear  constraint  is  % (u,v)  < L . The 

o — o 

Frechet  derivative  of  T at  (x,y)  is  given  by 

L(u,v)  = -vX  + Vx  + uy  - iiy  - 3(xu  + yv)< 

where  k is  the  curvature  of  (x,y).  It  is  not  hard  to  see 
that  L maps  X onto  L°°;  thus,  according  to  Theorem  2 and 
Corollary  3 there  is  a function  h z with  0 <_  JhL(u,v) 
whenever  u(t^)  = v(t^)  = 0 for  j = 1,...,N  and 
0 >.  J!/(u,v);  this  h also  satisfies  <h  0 a.e.,  and 

|k|  = a a.e.  where  h ^ 0.  Some  integration  by  parts  shows 
that  h satisfies  the  differential  equation 

(9)  ii  + K2h  = AK 

where  X is  a scalar,  X £ 0 ; if  A < 0,  then  whenever  h = 0,  we 
must  have  k = 0 also.  In  general,  if  h is,  say,  positive 
on  (a,b)  with  h(a)  = h(b)  = 0 then  (9)  shows  that 
b - a >_  tt/u.  Thus  the  intervals  on  which  h has  one  sign 
have,  with  at  most  2 exceptions,  length  at  least  u/a.  If 
A = 0,  then  it  is  possible  that  h vanishes  on  some  interval 
(a,b)  and  yet  k does  not.  If  so  we  may  just  repeat  the 
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minimization  process  on  (a,b)  using  as  admissible  functions 
those  pairs  (£>n)  in  U which  agree  with  (x,y)  outside  the 
interval  (a,b).  In  this  way  we  obtain  a solution  consisting 
of  finitely  many  arcs  of  circles  and  straight  line  segments. 

S.  D.  Fisher 

Northwestern  University  and  Mathematics  Research  Center, 
University  of  Wisconsin.  Research  supported  in  part  by  the 
National  Science  Foundation  and  by  U.S.  Army  Contract  No. 
DAAG29-75-C-0024 
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THE  PROBLEM  OF  MINIMAL  PROJECTIONS  IN  -SPACES 
C.  Franchetti  and  E.  W.  Cheney 


This  note  presents  an  outline  of  the  theory  of  minimal 
projections  onto  finite-dimensional  subspaces  of  i^(T,  ,u). 

We  define  a projection  to  mean  a bounded  linear  map  P:  X -* Y 
of  a normed  space  X onto  a subspace  Y having  the  property 
that  Py  = y for  all  y e Y.  Such  a projection  is  minimal  if 
jjp||  < ||Q|!  for  every  projection  Q from  X onto  Y.  A con- 
crete determination  of  the  minimal  projection  of  j^[-l,l]  onto 
the  polynomials  of  degree  1 is  given  as  an  illustration  of  the 
theory . 


If  P is  a projection  of  a normed  space  X onto  a finite- 
dimensional subspace  Y,  then  for  any  basis  (y^,...,ynj  of  Y 

there  exist  functionals  f,,...,f  in  X'c  such  that  P = 

1 n 

n n 

Z f.  x, y..  This  notation  means  Px = Z f.(x)y..  If  X - 
i=l  i ^ i i=l  1 1 

/^(T,Z, m)  and  if  (T,Z,  p)  is  a cr-finite  measure  space,  then 
X*  can  be  identified  with  / (T,  .,  p)  and  the  projection  takes 

CO 

n 

the  form  Px  = /L  (x,u.)y.  where  u.  el  and  (x,u)  = 
i=  1 i i l 00 

/xft)u(t)  dp(t)  . The  projection  property  is  equivalent  to  the 
requirement  that  (u^.y^)  = 8 (l<i,j<n). 

For  the  projection  P = Zu  x, y . we  define  the  kernel  to 
n 

be  K(s,t)  = Z u.(t)y.(s).  The  projection  has  the  form  of  an 
i=l  1 1 

integral  operator  with  kernel  K:  (Px) (s ) = /K(s , t )x(t )dt . Next 
we  define  the  Lebesgue  function  of  P to  be 


Ap( t)  = / I K(s , t ) | ds . 

One  can  prove  that  A depends  only  on  P and  not  on  its  par- 
ticular representation.  Furthermore,  the  norm  of  P as  an 
operator  on  /j  is  given  by 
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P([  = ess  sup  Ap(t). 

The  minimal  projection  problem  for  the  subspace  having  ba- 
sis > • • • >yn}  can  now  stated  as  follows.  Among  the  n- 

tuples  (Uj , . . . >un)  satisfying  u.  e and  (u^ ,y^  ) = 6 , find 

one  for  which  the  expression  ess^sup  / |Lu^  (t )y^ (s ) | ds  is  a 
minimum. 

The  first  important  characteristic  of  a minimal  projection 
in  this  setting  is  that  its  Lebesgue  function  must  be  constant. 
The  precise  theorem  is  as  follows. 


THEOREM  I.  Let  (T,Z,  p)  be  a finite  and  nonatomic  measure  space. 
Let  P be  a minimal  projection  of  jf  (T,Z,  m)  onto  a smooth 
finite-dimensional  subspace.  Then  A^  is  constant. 

Here  the  word  "smooth"  is  used  as  in  Banach  space  theory: 
each  point  on  the  unit  sphere  has  a unique  supporting  hyperplare. 
This  is  equivalent  to  the  requirement  that  no  non-zero  function 
in  the  subspace  can  vanish  on  a set  of  positive  measure. 

Another  necessary  condition  satisfied  by  a minimal  projec- 
tion is  as  follows. 


THEOREM  11.  Assume  the  same  hypotheses  as  in  Theorem  I,  and 
n 

let  P = A u.  x.y.  be  a minimal  projection.  Then  there  do 

i=1  x c J 

not  exist  v,  , . . . ,v  e Y-^-  D £ satisfying 

In  oo  — — * “ 

n 

ess  sup  Z v.(t)f.(t)  < 0. 
t i=l  1 1 

where  f ^ ( t ) = /y^Csjsgn  K(s,t)ds. 

THEOREM  III.  The  two  necessary  conditions  given  in  Theorems  I 
and  II  taken  together  form  a sufficient  condition  for  the  mini- 
mality of  a projection. 

In  order  to  give  an  idea  of  the  techniques  used  in  proving 
these  theorems,  we  shall  outline  here  the  proof  of  Theorem  I. 
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Suppose  that  P is  a projection  whose  Lebesgue  function  A is  not 
constant.  Then  there  exists  a positive  number  6 such  that  the 
two  sets 

A = (t  I A(t)  > ||p||  - 6} 

B = (t  : A(t ) < ||p||  -5} 

have  positive  measure.  In  a separate  lemma  we  establish  that 

there  exist  disjoint  measurable  sets  B, , . . . , B in  B and  a basis 
J I n 

{y  , ...,y  } for  Y such  that  / y = 5 . This  requires  that 

in  B^  j ij 

the  measure  space  be  nonatomic.  Next,  select  u in  £ 

1 n oo 

such  that  P = Z u^  (jc)y  . We  need  now  the  fact  that  A is  inde- 
pendent of  the  representation  of  P,  so  that  A(t ) = ||  Zu^  (t  )y^  ||^  . 
Now  we  perturb  the  functions  as  follows. 

f ui(t)  + if  t e B 

Vt> - I 

(1-e  )ui(t)  if  t e C, 

Here  C = T \ (B  U ...  U B ) and  e = e / u.y  . Because  of  the 

^ J £ 1 J 

special  form  of  the  perturbed  u^,  one  can  show  that  the  opera- 
tor  Q£  = Z u^  (5c) y i is  a projection  onto  Y and  that  |(Q  ||  < ||p|| 
for  some  e. 

As  an  example  of  the  theory,  we  now  outline  a determina- 
tion of  the  minimal  projection  from  ^ [-1,1]  onto  the 

space  of  ist-degree  polynomials.  This  problem  was  posed  to  us 
by  Professor  Carl  de  Boor.  One  begins  by  calculating  the  / - 

norm  of  an  arbitrary  element  of  7T  . Namely,  |a+bt  |dt  = 

2 2 i -I 

2 | a | or  (a  +b  )/  |b  | according  to  whether  |a  | > |b  | or  |a  | < |b  | . 

Next  we  fix  a basis  for  the  subspace:  y^(t)  = 1 and  y2(t)  = t. 

It  is  possible  then  to  prove  that  among  the  minimal  projections 
2 

P = Z u . (x^  y there  is  one  for  which  u1  is  even  and  u is  odd. 

i=l  1 1 1 1 

We  next  show  that  if  such  a projection  has  the  additional  pro- 
perties 
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o < |u1|  < |u2  I 
u2  U1  2XtU].U2 


U1  + U2  = p'u2  ' 

for  appropriate  constants  X and  p,  then  it  must  be  minimal. 
The  two  functions  which  have  all  these  properties  are  given  on 
the  interval  [0,1]  by  the  formulas 


u2(t)  = 


-X 


2 log  | 


f,  , , 2 2 

1 + X t 


-xt  v(V  + ! 


') 


u 1 ( t ) = u2  ( t ) 


X 


rn  7 

t + VX  t +1 


in  which  X = (1  - P ) / ( 2^)  and  {■  is  the  root  of 


2 2 

2£(1  ” I f |)log  | + 1 = | between  0 and  1.  The  numerical 


values  are  F = .32796779,  \ = 1.36055608,  (|p1|  = -7,/log  | = 

1.22040491.  The  minimal  projection  of  ^ onto  is  unique. 
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Let  w(x)  = exp{-Q(x)}  where  Q(x)  is  an  even  convex 
function  defined  on  (-00,00)  . We  define  the  sequence  {q  } 
by  the  relation 


q Q'(q_  - 0)  < n < q Q'(q  +0)  (n  = 1,2,...) 
n n — — n n 


and  we  set  q^  = q^ . 

Then  under  suitable  additional  conditions  on  Q we  have 


THEOREM  1 . For  an  arbitrary  polynomial  of  degree  net 

greater  than  n we  have  for  every  1 < n < » 


(*) 


T < 

1 n1 1 p — 


A — 


n"  p 


Here  ||'||p  is  the  norm  of  The  number  A depends 

on  the  choice  of  w but  does  not  depend  on  n or  x . 

C — n 

In  particular,  the  following  conditions  (i),  (ii) , (iii) 
together  are  sufficient  for  (*)  to  hold: 

(i)  Q e C2(0,»), 

(ii)  Q"(x)  is  positive  and  nondecreasing  in  (0,°°)  and 
(±ii)  X Q^-C0  (°  < X * °°)* 

Note  that  (i),  (ii)  and  (iii)  are  satisfied  if  a =1  2 and 
Q(x)  = lx[a,  i.e.,  w (x)  = exp{-|x|a}. 

We  apply  (*)  to  prove  precise  Bernstein-type  converse 
theorems  in  weighted  polynomial  approximation. 
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For  more  special  weights  and  using  simpler  arguments  we 
prove  similar  results  in  [2]  and  [3]. 

1 Proof  of  the  inequality  (*) 

The  case  p = 00  of  (*)  will  be  proved  separately  in  our 

paper  [6].  The  only  comment  we  make  here  is  that  far  deeper 

arguments  are  needed  than  those  which  applied  to  prove  some 

special  cases  of  it  in  our  papers  [1]  and  [4].  In  the  course 

of  extending  (*)  to  1 < p < <»  we  will  need  the  following  re- 

~ — 2 

seults  proved  elsewhere:  We  denote  by  {s  (w  ;f;x)  } the  par- 

v 

tial  sums  of  the  expansion  of  a function  f satisfying 

2 

fw  e Ip  in  the  orthonormal  polynomial  system  {p  (w  ;x)} 
and  we  introduce  the  delated  de  la  Vallde  Poussin  means 

(1.1)  v (w2;f;x)  =—  I s (w2;f;x). 
n n n+1  v 

Then  by  [6]  Lemma  4.1  resp.  [3]  Theorem  4 . 1 we  have 


(1.2) 


v (w  : f ; • ) w < c,  fw 
n °o  — 1 


and  in  virtue  of  [6]  Theorem  3.1 

(1.3)  ||v'(w2;f;.)w||  < c9  — ||fw||  • 

11  n 11  co  — 2 q 00 

^n 

Let  us  denote  by  the  set  of  polynomials  which  have 

degrees  net  greater  than  n.  Then  we  have 

(1.4)  v (w2; t ) = t , (x  e P ) 

n n n n n 

and  for  arbitrary  f and  every  x e P we  have 

n n 

(1.5)  J[f  - v (w2; f ) ] x w2  dx  = 0. 

n n 


i 


i 
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In  order  to  prove  (*)  for  p = 1 we  observe  that 

(1.6)  IIt'wIL  = sup  / t'  g w2  dx 
n 1 n 

where  g runs  through  all  functions  satisfying  1 1 gwj | ^ 1. 

By  (1.5)  and  a subsequent  integration  by  parts 


(1.7)  f t * gw  dx  = /t'v  (w  ;g)w  dx  = -ft  v'(w  ;g)w2dx 
Jn  Jnn  ■'nn 


+ 2 (t  Q'v  (w  ;g)w  dx 
J n n 


so  that 


(1.8)  |JVgw2dx|  i||Tnw||1[||v^(w2;g;*)w||so  + 2||Q  'vn(w2;g;  • )w)|j  . 


The  first  term  in  brackets  can  be  estimated  by  (1.3).  As  to 
the  second  term,  we  have  in  virtue  of  [5]  Lemma  4.2  and  (1.2), 
(1.3), 


(1.9)  llQ ’ vn(w2; g;  * ) wfl^  <J|Q'  [vn(w2;g;  .)  - v^w2; g;  0)  ]w||( 


+ c3  lvn(w2; g;0)w(0) | 
+ c3||vn(w2;g;.)w||oo  < 
We  infer  from  (1.6)  that 


1 c4llVn(w58’*HI( 


c,  *-||gw||  < c,  ^ 

5 q 11  11  oo  — 5 q 


ll^ll  i (c2  + 


\ 11 
5>  Z 


We  know  that  (*)  is  valid  for  p = 1 and  p = <».  The 

general  case  1 £.  P 00  can  be  obtained  from  these  two 
particular  cases  by  applying  the  Riesz-Thorin  interpolation 
theorem.  (Same  argument  as  in  [1]  and  [4]). 
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2 Remarks  on  direct  theorems 

of  weighted  polynomial  approximation 

In  our  paper  [3]  we  introduce  generalized  (.^-continuity 
moduli  of  functions  with  respect  to  weights  as  follows.  For  a 
4>  satisfying  i>w  e L we  set 

(2.1)  w(L  , w;  6)  = sup  ||T  (4>w)  - <tw)|  + 6||Q'$w)| 

P |h|<6  h "P  6 UP 

where  is  the  translation  operator,  i.e.,  [ f (x) ] = f (x + h) 

and 

(2.2)  Q^(x)  = min[6_1, |Q’(x) |]. 

Note  that 

(2.3)  0 £ 6Qj(x)  ^ 1 (6  >_  0)  . 

This  expression  characterizes  the  order  of  optimal  weighted 
polynomial  approximation  in  the  same  way  as  the  usual  contin- 
uit>  modulus  of  2rr-periodic  functions  determines  the  order  of 
trigonometric  approximation.  In  fact  let 

(2*4)  e^(w;f)  = inf  ||(f  - t)w||  i 

teP 

n 

then,  as  we  proved  in  [3] 

(p)  c7r/<i  \ t \ q 

(2.5)  tn+r(w;*)  < c6e  (^jr  m(Lp,w;$  r ,^) . 

This  is  the  analogue  of  Jackson's  trigonometric  approximation 
theorem.  In  the  last  two  chapters  of  [31  wc  deduced  (2.5) 
from  the  two  assumptions  that  (1.2)  is  valid  and  that 
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q co 

(2.6)  e^(w;<i>)  < c — / w(x) |d$(x)  | . 

n — on 

— CO 

Let  us  now  observe  that  (1.2)  certainly  holds  if  (i),  (ii) 
and  (iii)  are  satisfied  and  that  we  proved  in  [5]  that  (2.6) 
is  true  if  (i)  is  valid  and  we  have 


(2.7) 


-i  < lim  Q'  (2x) 
x-*°°  Q ' (x) 


< lim 
x-x*> 


Q'(2x) 

Q’(x) 


I 

«=. 


In  what  follows  we  replace  (2.7)  by  the  slightly  more 

I 

restrictive  condition  j 


(2.8,  1 < c,  < x < c10. 


Thus  we  assume  that  (i),  (ii)  and  (2.8)  are  satisfied.  This 
is  certainly  true  if  w(x)  = exp{-|x|a),  a > 2. 


We  observed  in  [5]  that  we  can  replace  w(L  ,w;$ 

qn  p 

by  the  smaller  factor  U(L  ,w:$,  — ) where 

p n 


(r) 


(2.9)  ft(L  ,w;f,6)  = inf  (L  ,w;f  - a, 6) 

P aeR  P 

i.e.,  ^(lp,o),f,6)  = sup  || T^ ( f w)  - f w|| p + 6 inf||Q^(f  - a)w|| . 

In  fact,  e^P^(w;$)  does  not  change  if  we  replace  $ by 
$ - (1/r .)axr  . Consequently  we  can  replace  on  right  side  of 
(2.5)  m(4>^r^)  by  infm(<I|^r^-a) . We  can  give  a more  accurate 

picture  on  weighted  polynomial  approximation  in  terms  of  W 
than  in  terms  of  w.  In  particular,  we  introduced  in  [5]  the 
Peetre  type  K functional 


(2.10)  K(Lp,w;$,6)  = inf  (U^wll  + «||f  Jw||p> 

fl+f2=<t 

and  proved  that 
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(2.11)  c K(L  ,2;$,6)  < ft(i  ,w;<J>,6)  < c K (L  ,w;4>,6). 

11  p — p — 1Z  p 

As  a last  remark  to  this  chapter,  let  us  observe  that  we  have 

(2.12)  a(L  ,w;f  +f  6)  < n(L  .wjr.fi)  + n(L  ,w;f_,5). 

P1Z  — pi  p Z 

3 The  converse  theorems  of  weighted 
polynomial  approximation 

On  combining  (*)  with  (2.11)  and  (2.10)  we  get  for  arbi- 
trary r e P 
n n 

(3.1)  J)(L  ,w;t  ,6)  < c K(L  ,w;x  ,6)  < c196||t'w|] 
p n — 12  p n — 12  11  n Np 

< Ac  <5  — ||  t wj|  (t  e P ) . 

— 12  q " n "p  n n 

'n 

Let  $w  e L and  (x^  e P^}  be  a polynomial  sequence  for 
which 


(3.2)  ||  (*  - Tk)w||p  < 2c£p)(w;$). 


In  view  of  (2.12)  we  have  then 

n 

Z 

k=l 


(3.3)  fi(L  ,w;t  ,6)  < Z Q(L  ,w;t  , - t . . ;6) 

P 2n-l  ~ k-1  P 2k-l  2k_1-l 


n 2^ 

<_  Ac  2*5  £ q \ IKt  k T k-i_ 

k=l  q2  2-1  2 -1 

- Ac126  z ~ T k )W^P 

k-1  V 2-1  P 


+ ‘ T k-1  HU 

2*  -1  P 

^ /n\ 

-2AC126  z Tk  [e  k (w;*> 
k-1  q2  2-1 
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+ E<?> 


e In  (w>.*)] 

2 -1 

X y lc  / \ 

lc136  z 7T  e k (w;*> 

L k=0  q2K  2-1 

It  follows  by  (2.10),  (2.11)  and  (2.12) 


(3.4)  Q(L  ,w ;$,6)  < Q(L  ,w;$  - t ,6)  + (2(L  ,w;t  ,6) 
P - P 2n-!  P 2n-l 


HL  + 0(1, w;t  n 6) 


'12  2n-l  "P  P'  ' 2n-l 


- 2C12  e<'n)  (w’*>  + C136  Z l~ke(k)  (w;<t) 

2n-l  J k=0  q2  2-1 


THEOREM  3.1  We  have 


q2n 


2n  n-1 


(3.5)  fi(L  ,w;*,  ^-)  < c.,  E e(P}  (w;$). 

P 2n  - 14  2n  k=Q  2k_x 


Proof . Combining  the  identity 


P1 ' -»hq2n 


*2n 


Q’(x) 


with  our  assumption  (2.8)  we  obtain  that 


1 < c15  < f±<  c < 2 ; 
n 


consequently 


2 2^+Z  2^  2 

(3.7)  2 > — > /- — > — > 1. 

C15  q2a+l  q2k  C16 

It  follows 


(3.8)  e(P}  (w; $)  < 


2n-l 


4 n n-1  k . . 

^ c17  E e ^ (w;<f) 

17  2n  k=0  q2k  2n-l 
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\n  n-1  k , . 

< C — r — £(P)  („;»). 

“ 17  2”  k-0  q2k  2k-l 

Inserting  <5  = 2 nq  in  (3.4)  and  applying  (3.8)  we  get 

2 

(3.5),  q .e  .d . 

THEOREM  3.2  Let  $w  e L and  for  a natural  r let 

p 

00 

(3.9)  Z vr_1  q“r  e(p)  (w;4»)  < ». 
v=l 

C r ^ 

Then  4>  has  an  r-th  derivative  4>  satisfying 
*(r)w  e L and 


(3.10)  (w,$(r))  < clge 


ciQr  00  9m  r t \ 

E0  ^ 

m=0  „m  2 n 

2 n 


Proof . Observe  that  (3.9)  is  equivalent  to  the  convergence 
of  the  series  on  the  right  of  (3.10).  By  (3.2)  and  (*)  we 
have,  considering  also  (3.7), 


9m+l  m "p  — 18  \ q / 0m 

2 n 2 n \ n m ' 2 n 

2 n 


Summation  over  m gives  the  estimate  (3.10)  for 
II  - Tnr))wllp’  9-e.d. 

THEOREM  3 . 3 If  for  some  nonnegative  interger  r and  some 
0 < p < 1 there  exists  a constant  B > 0 for  which 

(3.11)  e(p)(w;$)  < B(— )r+p 


then  has  an  r-th  derivative  satisfying  w e L and 

p 


(3.12)  0(Lp,w;<!>^r  ,6)  <_  C(w,r  ,p)B6P  . 
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Note  that  if  we  replace  in  (3.12)  ft  by  w then  the  factor 
C would  depend  on  <t. 
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A SEMI-INFINITE  LINEAR  PROGRAMMING  PROCEDURE  AND  APPLICATIONS 


TO  APPROXIMATION  PROBLEMS  IN  OPTIMAL  CONTROL 
K.-H.  Hoffmann  and  A.  Klostermair 

We  generalize  the  well  known  Dual  Simplex  Method  to  a con- 
vergent exchange  algorithm  for  semiinfinite  linear  programs  in 
Rn.  Neither  for  practicability  nor  convergence  proof  do  we  need 
some  kind  of  Haar's  Condition.  The  procedure  differs  from  the 
Carasso-Laurent-Topf er  [1]  and  the  Schafer-Algorithm  [4]  in 
milder  assumptions  for  convergence  and,  because  we  do  not  need  to 
solve  auxiliary  problems  at  single  iteration  steps  it  seems  to 
converge  faster.  Often  approximation  problems  can  be  reformula- 
ted as  semiinfinite  programs  in  R . Therefore  the  procedure 
seems  appropriate  for  solving  approximation  problems  under  side 
conditions . 


1 Semiinfinite  convex  programs 

Let  T ,T,  be  disjoint  index  sets  and  T :=  T U T. 
o 1 o 1 

For  1 6 T we  consider  fixed  elements 

bte*,  yte*n> 

and  define  the  hyperplane 

Ht  :=  { x £ Rn | < yt,  x > = b } 

and  the  halfspace 

HSt  :=  {xeRn|  < y ^ , x > >_  b }, 

where  < ♦,*  > means  the  innner  product  in  Rn.  The  set  of 
feasible  points  for  the  programming  problem  is  defined  by 

M :=  ( r\  H ) r\  ( r\  HS  ). 

1 6 T t e T. 

o 1 

Let  u £jn  be  a fixed  target  vector.  We  consider  the 
following  semiinfinite  linear  programming  problem 

Preceding  page  blank 
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(P)  Find  £GM  and  a_GR  such  that 

< u,?c  > = inf  {<  u,x  > [ xGM)  =:  £ . 

This  is  a linear  program  with  a finite  number  of  variables  but 
an  infinite  number  of  side  conditions.  Approximation  problems 
can  be  reformulated  in  problems  of  this  type;  e.  g.: 
for  a given  function  f 6 C [a,bj  find  an  element 

6 V :=  span  (v,,...,v  ,)  C C [a,b]  such  that 

I n~  1 

Avev  || f - vll^  < || f - v 

is  equivalent  to  the  semiinfinite  linear  program: 

find  cofficients  (a  , a , ...,  a )fRn  such  that  a is  mini- 

1 2 n n 

ma 1 and 

n- 1 

+ (f  (t)  - Z a . v.  (t) ) < a 
— ..ii  — n 

i=  1 

for  all  t G [ a,b] . 

Corresponding  to  (P)  we  define  its  dual  program.  Let 

W :=  {w  = (wt)teT  wteK,  wt  f 0 for  t G T'  C T,  I T * j < <*>}, 

The  set  of  dual  feasible  points  is  defined  by 

M : = (w  G W w >0  for  tGT,,  Z wy  = u}. 

- * - 1 tei  f t 

(D)  Find  wGM  and  a€R  such  that 

Z w b = sup  {Z  w b I wGM*}  =:  a. 

L f t t 

tGT  t6T  C 

This  linear  programm  has  a finite  number  of  side  conditions  but 
an  infinite  number  of  variables. 

For  a geometric  interpretation  we  introduce  the  notation 

R-  :m  {(yt,bt)eRn  ' jt€T.  },  i=0,  1 (Restriction  set) 

and  R :=  R(  U R^ . Then  x is  a feasible  point  for  (P)  iff 

R()  C H(x,-I)  :=  {(y,b)&Rn  x R < y,x  > - b = 0}  and 

R , C HS(x,-l)  :=  {(y,b)£Rn  x r!<  y,x  > - b > 0}  . 
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Let  G :=  {u}  x ECRn  *.  Then  (P)  is  equivalent  to 
(PG) : Find  a hyperplane  H(x,-1)  such  that 

with  a minimal. 
wt  >_  0 for  teTj  } , 

then  (D)  is  equivalent  to 

(DG)  : Find  a point  P :=  (y  ^ ' \ . . . ,y  ^ , a)  6 G /'NC  with  a 
maximal . 

2 Relations  between  problem  (P)  and  (D) 

The  exchange  algorithm  presented  in  this  paper  solves 
problem  (D) . We  state  relations  between  (P)  and  (D)  in  the 
following  strong  duality  theorem. 

THEOREM  1 . Let  u satisfy  the  condition 
(V)  u is  relative  interior  point  of  C , 

Co  :=  {yeRn  |Vb6R  (y,b)eC  }. 

Then:  a_  = a (strong  duality  property)  and  in  the  case  M # 0 

a minimal  point  x of_  (P)  exists . 

The  main  idea  in  the  proof  is  a separation  argument  for  suita- 
ble defined  convex  sets  in  Rn+'.  Condition  (V)  seems  to  be 
very  strong  but  in  the  case  M ^ 0 and  span  {y ^ I t£T}  = Rn 
it  is  equivalent  to  M :=  {jcfcM  | < u,x  > = a.  } ^ 0 and  boun- 
ded . 

For  the  construction  of  the  algorithm  and  as  a stopping 
condition  on  the  iteration  we  need  the  following  characteri- 
zation theorem. 

THEOREM  2.  Let  (P)  and  (D)  have  the  strong  duality  proper- 
ty and  let  x be  a feasible  point  for  (P) . 


RoC  H(x,  i ) , R CHS(x,-D  and 
H(x,-1)  O G = { (u(l ) , . . . ,u(n) , a)} 

Define 

C :=  { l w (^t)  waR  for  t£T, 
_ , ™ t b_  t 
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Let  T*:  = {t  , t^  } be  a finite  subset  of  T and 

w*,  w*. . . . , w*eR  with  w*  > 0 for  t.^Tp  We  define: 


wf  for  t = t.  (1  < i < k) , 

i l — — 


Let 


w 


for 


k 

E 


* *■ 

w . y . = 

l J l 


= u . 


Then  the  following  statements  are  equivalent: 

(i)  x solves  (P)  and  w = (wL)ce^  solves  (D) . 

(ii)  < u,x  > = E w b 

tfe  T 1 t 

(iii)  < y^.jc  > = b^  for  i=l,2,...,k. 

We  intend  to  construct  a maximal  sequence  of  feasible 
points  for  (D) . Similar  to  finite  programs  it  is  sufficient  to 
consider  a subset  of  M consisting  of  basic  solutions. 


DEFINITION  3.  Let  w = (wt'>teT^u  and 
T(w)  :=  {tfeT  | w(t)  4 0}. 

(i)  w£W  is  a basic  solution  <=>  ^ t6T (w)  ^ linearly 

independent . 

A f/t- 

(ii)  w £■  W is  a feasible  basic  solution  <=>  w£  M , w is  a 
basic  solution. 


THEOREM  4.  _If  (D)  has  a solution,  there  exists  a basic  solu- 
tion solving  ( D ) . 


DEFINITION  5.  w = (wt>t  6W  is  degenerate 

<=>  span  (y  | t£T(w)  U T } ^ R°. 
t o 

3 The  algorithm 

• — s 

In  the  step  s we  have  a basic  solution  w and  according 

— g # 

to  Definition  5 an  index  set  T(w  ) C.  T with  (y  ) ,-s. 

t t fe  i (w  ; 
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linearly  independent.  In  contrast  to  the  Carasso-Laurent-Topfer 
and  the  Schafer  Algorithm, at  each  step  we  work  with  an  index 

s'  = „ 


TS  D T(wS)  with 


(yt)t  ^.s  linearly  independent  and  |T" 


For  this  purpose  we  introduce  an  auxiliary  index  set 
T 


:=  (1,2 n },  T AT„  = 0, 

l.  z 

(t~th  unit  vector  in  F ),  b =0  for  t6T 


and  vectors  y :=  e 
J t t 


2 " 


If  necessary 


we  complete  (y  ) ,-s.  to  a basis  of  R by  adding  vectors 

t ttl(w  ) 


y with  t€-T2. 


I T®  ' = 


= n has  the  advantage 


The  property 

that  the  linear  equations  we  have  to  solve  at  each  step  have  a 
unique  solution, and  therefore  it  is  not  necessary  to  solve 

s 

auxiliary  problems.  On  the  other  hand  the  maximal  values  a are 

not  strong  monotone  increasing  and  it  may  be  that  cycles  appear 

To  avoid  this  disadvantage  we  introduce  auxiliary  target  vec- 
s . 

tors  (u.),  . , linearly  independent,  such  that  the  se~ 

l I < i < n 

s — s — s . 

quence  (a  , a,,...,  a ) is  strong  monotone  increasing 

l n s 6rJ 

in  the  sense  of  the  lexicographical  order. 

In  the  s-th  iteration  we  have  the  following  basic  situa- 
tion: 


for 

each 

ie  I : = 

{ 1 , 2, . . ,n}  we  have 

and 

for 

each  p6l 

(s)  - „n 
vectors  u tF  . 

P 

* 

s 

:=  f i 6 I 

t;6Tk},  k-0,1,2; 

s . . 

yi 

s 

:=  yts,  b 

, S . - 

. :=  b s . ^1; 
i 

T 

:=  tT  i6I). 
i 1 

(Al) 

(y‘I 

hei  is 

_n 

a basis  in  R ; 

(A  l») 

(us 

p 

Vfi  is 

a basis  in  Rn. 

For 

i e i 

i 

exists 

r wi  - ° 

s 

w . with 

l 

for  ier?  , 

(A2) 

1 

wS.  = 0 
k-  i 

1 

for  i61‘2  • 

(A3) 

U = 

Z w? 

iei 

S 

yi  * 
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For  i€I.  p£I  there  exists  w?  6.  ]R  with 
y i,P 

(A2Q  A i € if  V 0 < p < n (w.  >0/\.Ao<p’  <p  wf  , 

1 — — l , p — i > p 

(A31 *)  A p 6 1 uS  = E wf  yf  . 

P iei  i,P  yi 

Now  we  define  index  set6 


0)°; 


{ i £ I 


wf  >0*Ao<p'<P  wf  , = 0}  . 

itP  - i.P 


{Js} 

0 i 

is  a 

p 

p 

1 n 

s , 

w.  for 

(A4) 

-s 

w 

: = « 

l 

t 

0 else 

(A5) 

-s 

w 

: = 

(wS) 

V Vt6T 

<0 


is  a feasible  point  for  (D) . 


Starting  from  this  basic  situation  we  proceed  in  the  following 
way: 

according  to  the  characterization  theorem  2,  we  solve  the  line- 
ar equations 


(A6)  A i ei  < y®,  xS  > = b? 

S -s  s 

and  set  a :=  E w^  b^  = < u,x  > . 

Now  compute  the  quantity 

g(xS)  :=  sup({|bt  - < y^x55  >||teTQ}  U{bt  - <yt»xS  >|t€Tj}) 
and  distinguish 

S S • • — s 

1.  g(x  ) 0,  then  x is  a solution  of  (P)  and  w is  a 

solution  of  (D)  according  to  theorem  2, 

2.  g(xS)  > 0. 

Then  x is  no  feasible  point.  We  select  an  exchange  parameter 
t*S£  T such  that 

dS  :=  vS(bt*s  - < yt*s>  xS  >)  > 0, 


I ) w? 

x,o 


s 

w. 

1 
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where 


3M, 


sign  (bt*g  - < y^s,  x >)  for  tT  e TQ 


for  tfSeT, 


Because  of  condition  (Al)  the  equations 


i € I 


s s *-s 

c . y . = v y 

1 J 1 J 


, iei. 


have  a unique  solution  vector  (c.). 

1 l 6 I 

We  distinguish  two  subcases: 


A ier 


c?  < 0 a A i6I«  C?  = 0. 

i — 2 l 


Then  a_  = a = °°  and  (P)  has  no  feasible  points. 

b.  V i 6 1^  c?  0 V V i6l®  cf^O. 

• s s 

Then  we  replace  a certain  parameter  t £1  by  a parameter 
t*SGTUT2.  For  th|t  define 


w. 


S+1  • r 1 I ✓ • tS  S ./•  TS  Sy^\T 

r :=  min  {—  |(i£Ij  A c.  > 0)  V (i£  12  ^ °)  ) 

c . 

l 

Now  select  an  index 


s 

i*  i*s6l's  liei'ulMcNo^-  rs+l> 

1 2 1 S 

c . 

s+1  1 

and  vector  system  (u^  ) ^ such  that  conditions  (Al)  to 

(A3')  are  satisfied  with  s replaced  by  s+1  and 


s+ 1 
t. 

for 

i6lMi*s  }, 

1 

(0 

:= 

f 

1 

X 

[t*s 

for 

.#s 
i = i 

(2) 

A i 

IP’ 

1 p s 

s+1 

V 

:*  us,  , where 

p 

i 

P*S  : 

-■< 

r 

p 

for 

i*s6Jsp  , 

0 

for 

i &i2 

< 

(3) 

i*s^ 

‘o 

9 
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S+l 

(4) 


'S  S+l  S , . , _ . r .J*Si 

w^  - r c ^ for  1 g I \ { iT  } , 


s+ 1 s 
r v 


for  1=1 


There  are  many  possibilities  to  satisfy  conditions  (1)  to 
(4).  For  example  if  {iGl^  | c?  t 0}  ^ 0: 

■jk  ^ s s s 

select  i :=  i £ 1 with  c^  J1  0 arbitrarily.  Let 

{lj,...,!^}  :=  iifelj  I = 0}  , w^  defined  by 


1 


(4)  (i j < ij,  for  j < j’), 


s+l  s+l 

Then  set  u :=  y. 

P *i 


s+ 1 . 


for  I < p < k and  complete  (u^  ) 


to  a basis  of  Rn  ^ by  proper  elements  of  Rn. 

If  {ieiS2  | cf  * 0}  =0: 

let  p*  :=  p*S  :=  max  {p6lO(0}jV  i£J 
Select  i*  = if S£  with  c?*>  0 and 

pT  1+ 


I <p<k 


IS 

P 


c.  > 0} 

l 


s s 

— u—  :=  min  { - - +■ 
s s 

c . 

l 


ci* 


iejs*.,  C?  > o} 

pr  l 


Let  { i j , . . . ,ik) 


(ie  LJ 


D*< 


p < n 


T s I s+l 
J w.  = 0} , 
P 1 i,p* 


Let  u 


s+l 


for 


s c . *s  s+l  s+l 

u for  1 < p < p^  , u*.  :=  y. 

P P ~ s+l  I*+P  Lp 

1 < p < k and  complete  (u  ).  ^ . to  a basis  of  Hn. 

_ P ' _ P Pr‘*’k 

In  any  case  conditions  (1)  to  (4)  are  satisfied. 

To  start  the  algorithm  we  need  a starting  point 
w1  :=  ^wt^t£T"  According  to  the  construction  of  the  algo- 
rithm we  can  start  with  a degenerate  feasible  basic  solution. 
Let  a 6®  such  that  a < a < a = inf  {<  u.x  > I x6M}. 

Then:/\  x£M,  < x,u  > >a.  Let  o£T.  and  y :=  u,  b :=  a , 

— o 1 }o  o o 

Then 


for 


-I 

w 

t 


0 else 

is  a feasible  (degenerate)  basic  sulution  for  (D)  and  we 
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start  the  algorithm  with  w' : = (w^) . 

THEOREM  6.  (Convergence): 

Let  (V)  be  satisfied,  R bounded  and 

(i)  V yfe(0,l)A  seW  dS  £ yg(xS)  > 0 or 

(ii)  V (n)  ,-u  C W+»  lim  nS  = 0: 

new 

S-X» 

.s  / s.  s . ,s  _ 
d >_  g(x  ) - n and  d >0. 

Then  the  algorithm  is  convergent. 

The  proof  is  very  long  and  complicated  and  will  be 

published  elsewhere. 

4 Applications 

a.  Parabolic  boundary  control  problem 
The  problem  of  optimal  heating  of  a metal  in  a furnace  leads  to 
the  following  optimal  control  problem  (see  K.  Glashoff  and  W. 

Krabs  [2]).  Given  z€C[0,l],  find  a control  u6L  10,T], 

00 

0 <_  u(t)  <_  1 a.  e.,  and  a sulution  y of  the  boundary  value 
problem  2 

4-  y(x,t)  — — y(x,t)  = o,  (x,t)  e(o,  l)x(o.t), 

3t  3x2 

a £ yd.t)  + y(l,t)  = u(t),  0 < t < T, 

^ y(o,t)  =o  , o < t < t, 

y (x,0)  =0  , 0 < x < 1 , 

such  that 

max  | z(x)  - y(x,T) | 

0<^x£l 

has  minimal  value  £.  This  problem  has  a solution  and  in  the 
case  a > 0 , the  corresponding  optimal  control  u is  bang-bang 
with  a finite  number  of  switching  points  in  [0,T-e]  for  each 
e > 0.  Therefore  we  partition  the  interval  [0,T]  into 
subinterval 8 of  equal  length  and  consider  control  functions 
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u,  constant  on  each  subinterval.  The  remaining  finite  dimen- 
sional Tschebyschef f approximation  problem  under  side  condi- 
tions is  solved  with  the  algorithm  in  the  case  a = 0,1, 
z(x)  = 0,2  and  T = 0,2,  T = 0,3. 


b.  Hyperbolic  boundary  control  problem 
We  consider  a vibrating  string  of  length  one,  kept  fixed  at  the 
left  hand  side  and  controlled  by  a force  at  the  right  hand  side. 
The  string  has  a given  initial  state  and  we  intend  to  control  it 
such  that  the  whole  energy  at  a fixed  time  T is  minimal.  This 
problem  is  described  by  the  following  mathematical  model: 

Find  a control  function 

v£V  :=  (veC1  A [0,T]  |v(0)  = v' (0)  = 0,  ||v"|L  < 1> 


and  a solution  y(x,t;v)  of  the  boundary-initial  value  problem 
2 2 

~2  y(x»t) ^2  y<x’c>  = °»  (x»t)  €(0,  l)x(0,T)  , 

9 1 9x 


y(x,0)  =»  yQ(x) 

y(x,0)  =*  y j (x)  w 

y (o, t)  * o ") 


y(l,t)  * v ( t ) J 

1 3 2 

such  that  / [ (r-  y(x,T;v))  + 

0 9t 


x€tO,l], 


t eto.T], 

(■^  y (x, T; v) ) 2 ] 


dx 


has  minimal  value  a. 

When  y y are  proper  functions,  this  problem  has  a 
solution  with  £ 4 0 for  T < 2 which  satisfies  a weak  bang- 
bang  principle  (see  W.  Krabs  [3]).  One  can  solve  the  differen- 
tial equation  problem  explicitly.  Then  we  have  to  minimize 
+ l 2 

/ G'  (T  + x)  dx  over  v£-V  and 
. v 


4 

J 
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(T  + x)  := 


G1 (T+x) 


G' (T+x-2)  + v' (T+x-1 ) 


for  T + x 1 , 
for  T + x > 1 , 


G(x)  :=  j (yQ(x)  + / y^sjds). 

o 

For  computational  purposes  we  partition  the  interval  [0,T] 
into  subintervals  and  select  a subspace  of  V with  a basis 
consisting  of  suitable  hat  functions.  Then  we  solve  a quadra- 
tic program  with  a finite  number  of  side  conditions.  Numerical 
results  will  be  presented  elsewhere. 
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R.  B.  Holmes 

This  paper  discusses  some  approximation  theoretic  prop- 
erties and  applications  of  M-ideals  in  Banach  spaces.  Several 
open  questions  are  raised. 

Let  M be  a closed  linear  subspace  of  the  Banach  space 
X.  M is  a summand  of  X (resp.  an  ideal  in  X)  if  it 
(resp.  its  annihilator  M°)  has  a complementary  subspace  in 
X (resp.  in  X*).  All  summands  are  clearly  ideals,  but  the 
converse  is  false. 

We  are  going  to  consider  a special  kind  of  ideal.  Recall 
that  an  L-projection  (resp.  an  M-projection)  on  X is  a 
(bounded  linear)  projection  P satisfying  ||x||  = ||Px||  + ||x  - Px|| 
(resp.  ||x||  = max(||Px||,  |jx  - Px||)),  for  x e X.  Then  an 
L- summand  (resp.  an  M- summand)  is  the  range  of  an  L-projection 
(resp.  of  an  M-projection). 

LEMMA  1.  The  following  properties  of  the  subspace  M are 
equivalent; 

M°  is  an  L- summand  of  X* ; 

M°°  is  an  M-summand  of  X**; 

there  is  a projection  P:X*  + M°  such  that  the  unit  ball  U* 
splits;  U*  = co(P(U*)  U (I  - P)(U*)). 

The  proof  can  be  read  out  of  [1,2].  A subspace  of  X 
with  any  and  hence  all  of  these  properties  is  called  an  M- 
ideal  in  X.  Using  the  natural  duality  between  L-  and 
M-projections,  it  is  easy  to  see  that  every  M-summand  is  an 
M-ideal,  and  that  the  converse  is  true  in  reflexive  spaces  X. 
Since  any  L-projection  is  obviously  a metric  projection,  the 

Preceding  page  blank 
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annihilator  of  an  M- ideal  must  be  a Chebyshev  subspace.  Con- 
sequently, by  a well-known  theorem  of  Phelps,  an  M-ldeal  has 
the  unique  Hahn-Banach  extention  property. 

A variety  of  examples  of  particular  M-ideals  is  given  in 
[1].  Of  special  interest  is  the  fact  [4]  that  the  space  of 
compact  operators  on  certain  Banach  spaces  X (e.g.,  cq,  £p, 

1 < p < oo)  is  an  M-ideal  in  the  corresponding  space  of  bounded 
operators.  The  characterization  of  such  spaces  in  general  is 
an  open  question. 

LEMMA  2.  Let  M be  an  M-ideal  in  X.  Then  for  any  p > 0 


(1)  (1  + p)U  0 (U  + M)  «=  U + pU(M). 

Proof . (Adapted  from  [2]).  Let  x belong  to  the  left  hand 
side  of  (1),  and  let  Q:X**  \ M°°  be  an  M-projection.  Then 
Qx  e (1  + p)U(M°°).  There  hence  exists  w e pU(M°°)  such 
that  Q(x  - w)  = Qx  - w e U**.  Then  (I  - Q) (x  - w)  = (I  - Q)x 
also  belongs  to  U**,  whence  x - w e U**.  That  is, 

x = (x  - w)  + w e (U**  + p(U(M  ))  n XC  U + pU(M)  Cl  X 

= U + pU(M)\  thus  proving  (1). 

COROLLARY  1.  An  M-ideal  is  proximinal. 

Proof.  For  any  closed  subspace  M of  X let  Qw  be  the 
M 

quotient  map.  Then  M is  proximinal  iff  Q„(U)  is  closed 

in  X/M.  Now  suppose  that  M is  an  M-ideal  in  X and  that 

Qm(x)  e Qm(U)  . We  shall  find  y e U so  that  QM(x)  = QM(y)  .. 

Now  x e U + M so  by  (1)  x e U + pU(M)  for  any  p >_  ||xj|  - 1. 

Hence  there  exists  x^  e pU(M)  such  that  ||x  - xj|  < 1 + 1/2. 

Then  x - x^  e U + 1/2U(M)  and  so  there  exists  x^  e 1/2U(M) 

such  that  || x - x^  - x^H  < 1 + 1/4.  Continuing  inductively 

we  produce  a sequence  {x  }„  with  x e 2^  "uCM)  and 

n 2 n 

||x  - x^  - ...  - x ||  < 1 + 2 n.  Therefore,  if  we  set 


392 


M-IDEALS  IN  APPROXIMATION  THEORY 


y = x + I x , we  see  that  ||y||  <_  1 and  y - x e M,  qed. 

1 n 

This  result  was  proved  by  different  methods  in  [1].  In 
[2]  it  was  shown  that  this  argument  can  be  refined  to  further 
yield  ]|y  - x||  <_  ||x||  - 1 + e,  for  any  given  e > 0.  From  now 
on  we  let  PM  be  the  set-valued  metric  projection  on  M. 

In  [7]  the  following  result  on  simultaneous  approximation 
was  established. 

LEMMA  3.  Let  M be  an  M- ideal  in  X and  let  K be  a 

compact  subset  of  X \ M such  that  0 < diam  (K)  < e < 

dist  (K,M) . Then  given  y e K and  a best  approximation 

z e Pw(y),  there  exists  "z  eD{Pw(x):x  e K}  with 
M M 

||z  - z||  < 2e. 

This  result  leads  to  the  Lipschitz  inequality 

(2)  <Vpm^’  pM^y^  -2IIX  " yll*  x«y  e x- 

Here,  d is  the  Hausdorff  metric  on  the  nonempty  closed 
H 

bounded  subsets  of  M,  and  the  estimate  (2)  is  sharp  [7]. 

We  now  consider  a second  and  perhaps  more  surprising  con- 
sequence of  Lemma  3.  We  say  for  brevity  that  a proximinal 
subspace  M is  anti-Chebyshev  if 

(3)  span  P (x)  = M,  x e X \ M. 

M 

THEOREM  1.  An  M- ideal  is  anti-Chebyshev. 

Proof . By  Corollary  1 any  M-ideal  M is  proximinal.  Now 
for  a given  x e X \ M let  V = span  P^(x)  and  select  any 
y e M.  Apply  Lemma  3 to  the  set  {x,  x + ay}  for  sufficient- 
ly small  a > 0.  If  w e P„(x)  fl  Pu(x  + ay),  then  w = ay  + z 
for  some  z e P^(x) . Solving  for  y then  shows  y e V,  q.e.d. 
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See  [7]  for  further  consequences  and  discussion  of  prop- 
erty (3)  . 

If  M is  an  ideal  in  X there  is  an  operator 
T:X  -*■  X**  such  that  T|M  is  the  identity  injection  and 
T(X)  c M°°.  If  M is  actually  an  M- ideal  then  Lemma  1 can 
be  employed  to  show  that  T also  satisfies 

(A)  ||x||  = max(dist(x,M) , ||lxj|),  x e X. 

0 

If  M is  the  metric  complement  of  M it  follows  from  (4) 

0 

that  T is  norm  preserving  on  the  open  set  X \ M and 

6 6 
that  M contains  the  subspace  ker(T).  Of  course,  M 

is  never  itself  a subspace  on  account  of  Theorem  1. 

0 

If  M is  an  M-ideal  then  M can  fail  to  have  nonempty 
iterior  [7],  but  it  will  if  M is  an  M-summand.  Does  this 
property  in  fact  characterize  M-summands  among  M-ideals? 

Examination  of  the  proof  of  Theorem  1 and  that  of  (2) 
in  [7]  shows  that  both  these  approximation  properties  of 
M-ideals  depend  only  on  the  "2-ball  property"  of  [1],  Now 
the  2-ball  property  is  not  characteristic  of  M-ideals  and 
as  far  as  we  know  this  class  of  subspaces  has  not  been  satis- 
factorily described.  Must  a subspace  with  the  2-ball  prop- 
erty necessarily  be  an  ideal? 

Property  (2)  is  known  to  hold  for  other  subspaces  be- 
sides M-ideals.  For  example,  it  is  valid  for  the  subspace 
of  continuous  functions  in  the  (real)  space  of  bounded  Borel 
functions  on  a paracompact  topological  space  [5,  p.  173], 
However,  although  an  ideal,  this  subspace  lacks  the  2-ball 
property.  In  contrast,  this  subspace  need  not  satisfy  prop- 
erty (3) . In  fact,  it  can  happen  that  some  bounded  functions 
have  unique  best  continuous  approximants . From  this  evidence 
our  anti-Chebyshev  property  appears  eery  restrictive. 
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M-ideal  theory  and  techniques  provide  a powerful  and 
unified  approach  to  compact  operator  approximation.  To  il- 
lustrate this  assertion  we  mention  a recent  result  due  to 
Chui  et  al.  [3].  Let  H be  a Hilbert  space  and  let  B(H) 
(resp.  C(H))  be  the  algebra  of  all  bounded  (resp.  compact) 
operators  on  H. 


THEOREM  2.  For  all  T e B(H) 
A scalar)  ^ 0. 


we  have  + AI): 


Thus,  there  is  a compact  operator  of  simultaneous  best 
approximation  to  the  line  {T  + AI).  It  follows  that  given 
T e B(H)  there  exists  K e C(H)  such  that  ||p(T  + K)||  = 
||p(T)|!  = dist(p(T),  C(H)),  for  all  linear  polynomials  p. 

Whether  such  a K can  be  found  for  higher  degree  polynomials 
is  an  open  question. 

As  has  also  been  observed  in  [10],  this  compact  opera- 
tor K has  a further  remarkable  property.  To  state  it,  let 

W (T)  be  the  algebra  numerical  range  of  T [9],  and  let 
o 

Wg(T)  = Wq(T  + C(H))  be  the  essential  numerical  range  of  T. 

COROLLARY  2.  For  each  T e B(H)  there  exists  K E C(H) 
satisfying  W (T)  = Wq(T  + K) . 

Proof.  Since  We(T)  =n{WQ(T  + K):K  e C(H)),  we  certainly 
have  inclusion  from  left  to  right.  Now  if  K is  the  compact 
operator  of  Theorem  2,  and  if  t c Wq(T  + K) , then 
|t  - A | <_  ||T  + K - Al|[  = || T - Al||e  for  all  A,  proving  that 
t e (T) , q . e.d . 

This  corollary  derives  some  of  its  interest  by  contrast 
with  the  recent  result  c (T)  = o(T  + K)  for  T e B(H)  [8]; 
here  c (T)  = 0{a(T  + K):K  e C(H)}  is  the  Weyl  essential 
spectrum  of  T. 

It  is  not  clear  what  operators  other  than  I can  serve 
in  Theorem  2.  We  are  in  effect  asking  what  lines  belong  to 
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0  6 

the  metric  complement  C(H)  . Since  the  contents  of  C(H) 

are  only  incompletely  known  [6]  this  will  be  a difficult  ques- 
tion to  answer  in  any  generality. 
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L.  Jaffe 

1 Rolle  regularity 

In  this  paper  we  discuss  problems  of  regularity  of 

Birkhoff  interpolation  matrices,  and  introduce  the  new  notion 

of  Rolle  regularity.  The  matrix  E = (e.,  )?  , f „ has 

ik  1=1  k=0 

elements  e^k  that  are  zeros  or  ones,  and  has  exactly  n + 1 
ones.  Let  [a,b]  be  a given  interval,  let  X:  a < ...  < 

x^  <_  b be  a set  of  knots  in  [a,b].  The  pair  E,  X is 
regular  if  the  interpolation  problems 

(1)  P(k)(xi)  « cik,  e.k  = 1, 

have  unique  solutions  in  the  space  of  polynomials  P of 
degree  n.  The  matrix  E is  regular,  if  E,  X is  regular 
for  each  X.  In  what  follows,  we  use  the  terminology  of 

[2] ,  [4],  [6]  without  explanation. 

DEFINITION  1.  A pair  E,  X is  Rolle  regular  if  for  each 
f £ Cn[a,b],  conditions 

(2)  f(k)(x1)  = 0,  eik  = 1, 

imply  the  existence  of  a £,  a < £ < b with  the  property 

(3)  f(n)  (O  = 0. 

The  matrix  E is  Rolle  regular,  _if  E,  X is  Rolle  regular 
for  each  X. 
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A matrix  E is  conservative  (Birkhoff  [1])  if  it  has 
no  odd  supported  sequences. 

THEOREM  1 . A conservative  matrix  which  satisfies  the  Birkhoff 
(strong  P61ya)  condition  is  Rolle  regular. 

This  follows  from  the  usual  proofs  of  the  Atkinson- 
Sharma  theorem  (see  [4],  [5]),  which  give  also  the  fact  (not 
mentioned  there)  that  the  possibilities  £ = a,  b can  be 
eliminated. 

Of  great  importance  in  Birkhoff  interpolation  is  the 
determinant  D (X)  of  the  system  (1),  and  the  Birkhoff  kernel 

L> 

Kp(X,  t) , a <_  t <_  b (see  [1],  [3],  [4]).  We  recall  the  fol- 
lowing facts: 

b 

(4)  / KE(X,t)dt  - De(x); 
a 

b , v 

(5)  If  f e Cn[a,b]  satisfies  (2),  then  J i n 'Kdt  = 0; 

a 

b 

If  g e C[a,b]  has  the  property  J gKdt  = 0,  then  there 

a 

(6)  exists  a function  f e Cn[a,b]  which  satisfies  (2)  and 
has  f^n^  = g. 

DEFINITION  2.  (Birkhoff  [1]).  An  interpolation  matrix  E 
is  simple,  if  for  each  set  of  knots  X,  the  kernel  K^X.t) 
is  of  constant  sign,  and  does  not  vanish  identically. 

In  this  case,  K(X, t)  is  of  the  same  sign  for  all  X,t. 
Indeed,  let  K(X',t')  > 0,  K(X", t")  < 0.  We  may  assume  that 
all  sets  of  knots  X in  question  have  the  same  first  and 
last  elements  x^  < x^.  For  each  X,K(X, t)  is  a continuous 
not  identically  vanishing  function  on  [x^,xm],  which  has  a 
constant  sign,  and  it  is  positive  if  X = X',  negative  if 
X = X".  Changing  X continuously  from  X'  to  X",  we  obtain 
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a contradiction. 

THEOREM  2.  Let  E be  a regular  matrix.  (i)  For  a given 
set  of  knots  X,  the  pair  E,X  is  Rolle  regular  if  and  only 
if  Kj,(X,  t)  is  of  constant  sign,  (ii)  E is  Rolle  regular  if 
and  only  if  it  is  simple. 

Proof.  If  K is  of  constant  sign,  and  if  f c Cn[a,b] 
satisfies  (2),  then  from  (4)  and  (5), 

b . . b 

/ fWKdt  = 0,  / Kdt  De(X)  * 0. 

a a 

We  deduce  that  f^  cannot  be  strictly  positive  or  strictly 
negative  on  [a,b],  hence  it  satisfies  (3). 

If  K changes  sign  for  some  X,  then  there  is  a strictly 
positive  continuous  function  g with  / gKdt  = 0.  For  f 

cl 

defined  by  (6),  we  have  (2),  but  (3)  does  not  hold.  This 
proves  (i) . 

One  can  sometimes  reduce  the  requirements  of  Rolle 
regularity  to  some  subclasses  of  C^n\ 

THEOREM  3.  (L.  Jaffe  and  G.  G.  Lorentz)  Let  E be  a 
regular  matrix.  Assume  that  no  polynomial  P (of  any  degree) 
with  the  property 

(7)  P(n) (x)  = (x  - a)P(B  - x)q,  a < a,  b < B,  p,  q * 0,1,... 

can  be  annihilated  by  E,X.  Then  E,X  is  Rolle  regular. 

Proof . For  each  selection  a < a,  B > b,  p,  q = 0,1,...  we 
must  have  F (a,B)  + 0,  where 

pq 

b 

(8)  F (a, B)  - / (t  - a)P(B  - t)qK(t)dt. 

pq  i 

For  otherwise  (6)  would  give  us  a polynomial  P satisfying 
(7)  and  annihilated  by  E,X. 
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Without  loss  of  generality  let  D (X)  > 0.  Since 

£ 

(^)P(B),'Fpq<“-6)  ■ / (1  - * de(x)  * 0 

a 


for  a -*■  -°°,  3 -*•  +«°,  and  since  F (a,  6)  never  vanishes,  it 

pq 

follows  that  F (a,  3)  > 0 for  all  choices  of  parameters. 

pq 

If  cjc  L 0,  k = 0, . . . , N,  we  derive  from  this 
b N 


J Z c (t  - a)k(3  - t)N  ^(Odt  >_  0. 


a k=0 


If  g is  positive  continuous  function  on  [a, 6],  it  is  pos- 
sible to  select  the  constants  c.  so  that  the  sum  under  the 

k 

integral  sign  will  become  the  Bernstein  polynomial  BlT(g)  of 

b N 

g on  [<*,&].  For  N -*■  °°,  the  relation  f B (g)Kdt  > 0 

b a N 

becomes  f gKdt  >_  0.  Since  g was  arbitrary,  we  must  have 

cl 

K(t)  0 for  all  t in  [a,b]. 

In  terms  of  Rolle  regularity  of  matrices,  this  result 
has  the  following  formulation. 

THEOREM  3A.  A regular  m x (n  + 1)  matrix  E with  m 2, 
with  n + 1 ones  and  with  zeros  in  the  last  column  is 
strongly  regular  if  and  only  if  each  matrix  E1  derived  from 
E by  adding  to  it  rows  0 and  m + 1 with  ones  in  posi- 
tions (0,n),  (0,n+l),  ....  (0,n+p-l);  (m+l,n),  ....  (m+1, 
n+q-1),  p,q  = 0,1,...  is  regular. 

Regular  must  be  also  the  matrices  obtained  from  E'  by 
coalescing  row  0 to  row  1,  or  row  m+1  to  row  m (or  both) . 

We  note  two  more  results,  which  can  be  derived  from  the 
above.  For  a matrix  E,  let  E (or  E)  be  the  (m  - 1)  x n 
matrix  derived  from  E by  omitting  from  it  the  last  column 
and  the  last  one  of  the  first  (or  the  last)  row. 

PROPOSITION  1.  A Birkhoff  matrix  E cannot  be  Rolle  regular 
if  one  of  the  matrices  E,  IS  is  strongly  singular. 
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PROPOSITION  2.  Let  E be  a Blrkhoff  matrix  with  row  1 or 
row  m containing  at  least  two  ones,  or  let  E be  a three 
row  Birkhoff  matrix.  If  one  of  the  rows  of  E contains 
exactly  two  odd  supported  sequences,  one  of  them  terminating 
in  the  penultimate  column,  then  K(X, t)  changes  sign. 

2 Counterexamples 

For  our  main  properties  we  have  established  the  following 
implications: 


CONSERVATIVE  -==£>ROLLE  REGULAR <£=*> SIMPLE  ^REGULAR. 


The  first  and  the  last  implications  cannot  be  inverted: 
EXAMPLE  1.  The  matrix 


0 0 0 0 
0 0 10 
0 0 0 0 


is  regular,  but  not  Rolle  regular. 

The  regularity  of  E has  been  noted  in  [5].  We  take 
X = (0,  x,  1);  the  kernel  K^CX.t)  changes  sign  for 
0 <_  x <_  t <_  1.  This  follows  from  the  formula 

3 

K(X,t)  = - ^ {3x(l  + t)  - 1 - 3t} . 

(This  result  follows  also  from  Proposition  2). 
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EXAMPLE  2.  The  matrix 


is  Rolle  regular,  but  not  conservative. 

This  Birkhoff  matrix  has  two  odd  supported  sequences 
and  is  not  conservative.  We  show  that  it  is  Rolle  regular 
by  explicit  computation  of  its  kernel  K_(X, t).  The  proof 
that  K(X,t)  0 for  all  X, t is  elementary,  but  not  quite 
simple.  Without  loss  of  generality  we  can  take  x^  = 0, 
x^  = x,  x^  = 1.  From  the  determinant  expression  for  K(X,t) 
we  obtain 


Kg(X,t) 


Hence 


KgU.t) 


{ 


iy(x " 

1 4 
Ti* 

1 3 

If* 

i(x" 

< 

1 2 
2* 

X 

jt(1- 

‘>1 

1 

5! 

1 

4! 

1 1 
4 ! 5 ! 

(1  - t) 

5 5 , 

x for 

x < t < 

xt 
2*4 ! 5! 


j f(x,  t)  for  0 <_  t <_  x. 


where 


f(x,t)  - (20x2  - 25x  + 8)x2  - 4tx(5x3  - 5x  + 2) 

2 4 14 

+ t (lOx  - 5x  + 2)  - 2t  x . 
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Evidently  Kg(X)  ^ 0 and  also  ^(X,  t)  <_  0 for  x <_  t <_  1. 

To  show  that  this  inequality  holds  also  for  0 t <.  x,  we 
use  critically  the  following  equivalent  expression  for  f(x,t) 


f(x, t)  = 2t  (1  - x)  + 2t(x  - t) (1  - 


x)4(x 


+ 4) 


+ (x  - t)2(-2x4t  + 20x2 


25x  + 8) . 


If  0 < t < x/2  we  have 

f(x,t)  >_  (x  - t)2(-x3  + 20x2  - 25x  + 8). 

Let  g(x)  = -x3  + 20x2  - 25x  + 8.  In  0 <_  x <_  1 the 

function  g attains  its  minimum  inside  the  interval  at  a 

4 

point  x where  x = 8x  - 5 and  0 < x < 1.  Hence  at  the 
minimum 


g(x)  = 12x2  - 20x  + 8 = 4(1  - x)(2  - 3x) . 

But  at  the  minimum  x = 5/8  + (l/8)x4.  Hence  x < 5/8  + 1/8  = 

3/4  which  in  turn  gives  the  better  estimate  x < 5/8  + 

l/8(3/4)4  < 2/3,  showing  that  g(x)  >_  0 for  0 <_  x <_  1. 

On  the  other  hand  if  (l/2)x  t <_  x we  have  t >_  x - t. 

Hence 


f(x,t)  > (x  - t) 2 [ 2(1  - x)5  + 2(1  - x)4(x  + t) 
-2x4t  + 20x2  - 25x  + 8] 


and  it  suffices  to  show  that 


h(x) 


10(1  - 

*>*- 

2x5  + 

2 

20x  - 

25x  + 

8 

2(1  - 

x)5- 

20x3  + 

2 

60x  - 

55x  + 

16  > 0 
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for  0 £x  < 1.  In  turn  it  suffices  to  show  that 

k(x)  = -20x^  + 60x^  - 55x  + 16  ^ 0 for  0 <_  x <_  1. 

2 

The  function  k is  at  a minimum  where  x = 2x  - 11/12 
at  which  point  k(x)  = (IQ^x  - 7/10).  The  minimum  occurs 
at  x = 1 - /l/12  = 0.712  which  is  greater  than  7/10! 

This  completes  the  proof  that  K„(X)  does  not  change 

hi 

sign  and  hence  that  E is  Rolle  regular. 
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BIRKHOFF  INTERPOLATION  BY  SPLINES 
K.  Jetter 

This  paper  deals  with  regularity  theorems  concerning  the 
problem  of  Birkhoff  interpolation  by  splines.  In  particular, 
it  is  shown  how  the  Polya  condition  can  be  extended  to  spline 
interpolation.  A duality  theorem  and  a generalized  Atkinson 
and  Sharma  theorem  are  treated  in  detail. 

1.  Introduction 


Let  k > 0,  m > 1,  n > 0 be  fixed  natural  numbers,  X := 
{-1  = xQ  < Xj  < . . . < xn+1  = +1}  and  X*  :=  {-1  = x*  < x*<. . .< 
x*^^  = +1 } fixed  real  point  sets  and 


E = 


n+1  m-1 
i=0  j=0 


E*  = 


e*  ) 

p.q  p= 


m-l 

q=0 


fixed  incidence  matrices  (i.e.,  e.  ,,  e*  =0  or  = 1).  The 

i,j  p.q 

problem  of  Birkhoff  interpolation  by  splines  consists  in  finding 

conditions  on  the  regularity  of  (E,X,E*,X*):  Does  there  exist, 

for  every  real  data  c.  . , a unique  spline  s e S, 

1 . J 


S :=  <l,...,tm'1,...,(t-x*y"'1"q, 
L E_  + 


). 


e = 1 

solving  the  equations  s^  (x  ) = c for  e.  . = 1? 

1 i . J 1 » J 

It  is  natural  to  assume  that  the  number  J E | of  interpo- 
lation conditions  is  equal  to  the  dimension  of  S, 

(Al)  |e|  = m + |e*| , 

and  in  order  to  have  well-defined  derivatives  s^^ 

= 1 , we  require  that 


(x.)  for 


i.J 

(A2 ) x = x*  =>  e 


i.J 


+ e 


>,m-l- j ^ lf  J ~ °«--* 


,m-l . 
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2 . The  generalized  P($lya  condition 


In  the  special  case  of  polynomial  interpolation  (where 
|E* | =0)  the  Polya  condition  is  well  known  (see  Schoenberg 
[7]).  In  a possible  formulation  this  condition  requires  that 
no  subblock  of  interpolation  conditions  may  produce  an  overde- 
termined interpolation  problem.  Consider  a triple  (r.i^,^), 

0 < r < m-1,  0 < i^  < < n+1,  and  let 


E = 


E*(r,ix,i2) 

1 

■- 

i 


1 

2 


m-l-r 


where 


c < x, 

pi'  *i 


< X*  , x* 
Pl+l  p2- 


1 < X < X 

1 i2  - p2 


With  respect  to 


the  basis 


1, 


tm-l 


i > 

u 


. * .m-l-q 
’ (m-l-q) ! ’ 


■k  * 
X < X 

P “ Pq 


V*  (m-l-q ) ' > 

* . * 
x > x 


of  S the  coefficient  matrix  associated  with  (E,X,E*,X*) 


takes  the  form  A = 


A„ 

2 

0 

A, 

1J 

if  rows  and  columns  of  A are 


ordered  as  follows:  The  lower  rows  are  associated  with  inter- 

polation conditions  characterized  by  e e E(r,i  ,i  ),  and 

the  columns  at  the  back  belong  to  the  basis  functions  t^  / j ! , 

r<  j < m-1,  and  (t-x*)™  1 q/(m-l-q)!  where  e*  € E*(r, i.  ,i.  ). 

p + p,q  12 

If  Aj  contains  more  rows  than  columns  then  A cannot  be  regu- 
lar. We  have  shown  (see  Melkman  f 6 J for  another  proof  in  the 
case  of  "Hermite"  splines) 

THEOREM  1 . The  following  generalized  Polya  condition  is 
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necessary  for 


( E , X , E* , X* ) to  be  regular: 


| E(r , i.  , i ) | < m - r + | E*(r , i. , i? ) | 

(A3) 

for  all  (r,i1>i2),  0 < r < m-1,  0 < < i2  < n+1. 


It  is  obvious,  too,  how  D.  Ferguson's  [2]  decomposition 
theorem  carries  over  to  spline  interpolation.  Call  the  problem 
(r , i^ , i2 )-decomposable , if  (r,i^,i2>  4 (0,0, n+1),  and  if  equal- 
ity holds  in  (A3).  Then  the  matrices  A^  and  A2  are  square 
and  may  be  interpreted  as  the  coefficient  matrices  of  two  easily 
identified  interpolation  problems  (J^)  and  (J2)* 

THEOREM  2.  An  (r , i ^ , i^ ) -decomposable  problem  (E,X,E*,X*)  is 
regular  if  and  only  if  the  induced  problems  (J^ ) and  (J2 ) are 
regular. 


3.  Regular  problems 


Some  regularity  results  can  be  proved  using  the  properties 
of  interpolation  kernels.  Recall  that  every  regular  problem 
(E,X,E*,X*)  is  characterized  by  its  kernel  k(x,t)  = det  A ^ 
det  A(x , t ) , 


A(x,t) 


Here  A is  the  coefficient  matrix  of  (E,X,E*,X*)  with  respect 
to  the  basis  of  S indicated  by  the  first  row. 

Our  first  result  is  a "duality"  theorem.  We  call  (E*,X*, 
E,X)  dual  to  (E,X,E*,X*)  if  E*  is  obtained  from  E*  by  adding 
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a new  first  row  (eQ  Q,...,e*  m_^)  an^  a new  last  row  (e*+^  q, 

. . . , e* , , , ) where  e,.  . + e*  , . = e . . + e*  - 1 . = 1 , 

k+1 ,m-l  0,j  0 ,m-l - j n+l,j  k+l,m-l-j 

j = 0,...,m-l,  and  if  E is  obtained  from  E by  discarding  its 

first  and  its  last  row. 


THEOREM  3.  (E*,X*,E,X)  is  regular  if  and  only  if  (E,X,E*,X*) 

is  regular. 


Proof.  Let  k(x,t)  be  the  interpolation  kernel  associated  with 
(E,X,E*,X*).  Define  k*(x,t)  on  the  open  square  (-1 ,+l )« ( -1 ,+l ) 
by  k*(x,t)  :=  ( -1 )mk(t ,x) . Then  k*(x,t)  is  essentially  the  in- 
terpolation kernel  associated  with  (E*,X*,E,X).  For  details 
see  [3] . 


We  shall  now  prove  a generalization  of  the  Atkinson  and 
Sharma  theorem  (see  [1]).  For  the  concept  of  odd  supported  se- 
quences consult  Lorentz  and  Zeller  [5]. 

THEOREM  4.  Assume  that  (E,X,E*,X*)  satisfies  (Al),  (A2),  and 
the  generalized  P61ya  condition  (A3).  I_f  E contains  no  odd  sup- 
ported sequence,  and  if  E*  contains  only  sequences  beginning  in 
the  first  column,  then  (E,X,E*,X*)  is  regular. 

Melkman  [6]  gives  a proof  of  this  theorem  in  the  case  of 
simple  spline  nodes  using  a Budan-Fourier  theorem  for  "Hermite" 
splines.  His  paper  includes  some  hints  for  treating  the  gener- 
al case. 

The  following  proof  using  an  induction  on  J E | proceeds 
from  a proof  of  the  Atkinson  and  Sharma  theorem  due  to  Lorentz 
[4].  We  may  assume  that  |e*|  > 0 and  | E | > 2. 

If  the  problem  (case  I)  is  (0, i^ , i^ )-decomposable  or  (case 
II)  is  (r ,0,n+l )-decomposable  with  E*  = [E*(r,0,n+1)  | 0 ] , we 
can  argue  with  the  decomposition  theorem,  Theorem  2,  using  the 
induction  hypothesis.  Therefore  assume  that  the  cases  I and  II 
are  excluded.  Then  it  can  be  shown  that  there  exist  positions 
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(i  ,0)  and  (pn,qn)  with  e n = e*  „ =1  such  that  the  re- 

U u u 10,u  ^0*^0 

duced  problem  (E,X,E*,X*)  which  is  obtained  from  (E,X,E*,X*)  by 

setting  e = e*  = 0 is  regular  according  to  the  induction 

i0.°  Po’^o 

hypothesis . 

Consider  the  interpolation  kernel  k(x,t)  associated  with 

(E,X,  e!*,X*),  and  put  kn ( t ) :=  k(x.  ,t).  Then  (E,X,E*,X*)  is 

L0 

(qo)  * 

regular  if  and  only  if  k (x  ) 4 0. 

U p0 

Kernels  of  this  type  have  been  studied  in  [3].  There  is  a 

minimal  interval  [x.  ,x.  ],  x,  < x.  < x.  , in  the  exterior 

1l  12  1\  X0  X2 

of  which  k^  vanishes  identically  and  in  the  interior  of  which 
kg  vanishes  only  in  discrete  points.  Using  an  estimation  of 
zeros  (see  Lorentz  [A],  or  [3])  k^  has  at  most  |E(0,i^,i  )|- 
m - 1 zeros , hence  by  (A3 ) at  most  | E*  (0,  i^  , ^ ) | - 1 zeros . 

But  if  ep,q  = 1 and  (p,q)  ^ (p0’q0^’  then  ko"<*V  = 0 

( Q ) 'k 

as  the  determinant  defining  k'^  (x  ) has  two  columns  differ- 

0 p 

ing  at  most  in  the  sign.  So,  k^  has  exactly  |E  (0,i^,i  )|-1 

zeros,  and--as  (p^jAn)  must  be  at  the  end  of  a sequence--x* 
00  p0 
is  contained  in  the  open  interval  (x.  ,x.  ).  Moreover,  we 

(q0}  * ^ "2 
must  have  k^  (x^  ) 4 0.  This  completes  the  proof. 
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NONUNIQUENESS  OF  SIMULTANEOUS  APPROXIMATION 


BY  TRIGONOMETRIC  POLYNOMIALS 
Darell  J.  Johnson 

We  consider  the  approximation  of  differentiable  func- 
tions by  trigonometric  polynomials  of  degree  at  most  n,  T , 

n 

in  the  norm 


max 


i=0,  . . . ,p' 


I f (ki)| 


where  0 = < . . . < k^  and  ||  .]|  is  the  usual 

Chebyshev  norm  on  C(T).  The  result  of  interest  obtained 
below  is  the  determination  of  the  set  of  best  trigono- 
metric polynomial  approximations,  in  the  sense  that  once 
one  best  approximate  is  known,  all  best  approximates  are 
known. 

More  specifically,  let  F ={kQ, k^, ... ,k^ },  where 

0 = kn  < k < ...  < k are  integers.  Let  B be  the  Banach 
U J-  p F 

space  of  kp-times  continuously  differentiable  real-valued 
functions  on  the  unit  circle  T = [— tt , tt  ] under  the  F-norm 
defined  as  above. 

By  standard  compactness  arguments,  given  f e B 

F 

there  always  exists  at  least  one  q*  e for  which 
llf  _ q*llF  ±llf  ~ Pll  for  alL  <1  e Tn<  Thus,  n(f)  = 

(P  e rn;  II f - p|If  1 II f - q|lF  for  all  q e is  non- 

empty for  all  f e B^,.  Therefore  A ( f ) = ||f  - q*||,q*  e fi(f), 

is  well-defined,  and  positive  if  f l T . Define  extremal 

n 

sets 
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Ui(p)  _ {x  e [ — tt  , tt ) ; | ( f - p)  ^ki\x)  I = ]Jf  - p! i F } 

LEMMA  1.  There  exists  a q*  e (2(f)  having  the  property  that 
Ui(q*)  - Ui(q)  and  (q*)(ki)(x)  = q^ki^(x)  j^f  x e U1(q*),  for 
every  q £ ft  ( f ) and  every  i = 0 , . . . , p . 

DEFINITION.  A q*  e (;(f)  having  the  properties  that 
U^(q*)  c U. (q)  and  q*  agrees  with  q on  U^(q*),  for 
every  q £ ft ( f ) and  every  i = 0 , . . . , p , is  called  a minimal 
polynomial  of  best  approximation  to  f . 

To  prove  the  lemma,  note  that  each  L\(q)  is  compact. 

Set  = ('{U.(q);  q c (2(f) }.  Then  there  is  a countable 

CO 

sequence  {q  } , c (2(f)  for  which  U.  = 0 ,U.(q  ),  for 

v v=l  i v=l  i 

every  i = 0,...,p.  Setting  q*  = I ,2  "q  , q*  e T by 

v=l  v n 

the  uniform  convergence  of  the  series  on  the  right  and  the 

fact  that  each  q £ T . By  convexity  of  (2(f),  each 
V n (k  ) 

q e 12(f)  also  implies  q*  e (2(f).  If  q j e (2(f)  differs 
V Ck  ) 

from  (q*)  j at  some  x^  e l'^  (q*)  = U , then  Uj ((q  + q*)/2) 
is  a proper  subset  of  U.(q*),  a contradiction. 

As  a corollary  to  our  lemma,  it  is  clear  that  if  p*, 
q*  are  both  minimal  polynomials  of  best  approximation  to 
f,  then  Ui(p*)  = lX(q*)  and  p*(x)  = q*(x)  for  all 
x c U (p*)  = U^(q*),  for  each  i = 0,...,p.  Abusing  notation 
slightly,  let  IL(f)  E U^(q*),  q*  a minimal  polynomial 
of  best  approximation  to  f,  for  each  i = 0, . . . . Neces- 
sarily U.  4-  0 for  at  least  one  j £ {0,...,p}. 

LEMMA  2.  A function  q*  is  a minimal  polynomial  of  best 
approximation  to  f £ B^XT^  if  and  only  if  there  exists  no 
q £ T for  which 

maxfsup  [ (f  - q*)  ^ki^q  ^ki^  ] (x)  ;x  e U^(q*);k^  e F}  z.  0 
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and  such  that  the  strict  inequality 


[ (f  - q*) (kj  ^q(kj ) ] (x. ) £ 0 


holds  for  some  j e {0 p}  and  some  x.  c U.(q*). 

The  proof  of  this  Kolmogoroff  characterization  theorem 
is  standard  (e.g.,  [3;  p.  19]). 

In  particular,  if  q*  is  a minimal  polynomial  cf  best  a 
proximation  to  f then  I U . (q*)  | _?  n + 1,  where  U.(q*) 

denotes  the  number  of  points  in  U.(q*)»  for  otherwise  we 
may  find  a q e that  satisfies  the  interpolation  data 

q ^kj ^ (x  ) * -(f  - q*)^kj^(x  ),  for  all  points  x e U (p*), 

v v v j 

for  all  j = 0, . . . ,p . 

LEMMA  3.  If  f e B_\T  is  (k.  + l)-times  differentiable 
on  [— tt.tt  ] , then  p e fi(f),  x e U^(p)  implies 
(f  - p)  ^ki+^ ) (x)  = 0.  Moreover  k^  + 1 = k^+^  implies  that 
Lh  (p)  and  LL+^(p)  are  disjoint. 

(k  ) 

Proof . Assumption  x e U^(p)  implies  that  (f  - p)  i 
has  a maximum  or  a minimum  at  x,  hence  (f  - p)  ^ki+^  (x)  = 0 
If  ki+2.  = then  x e IK(p)  fl  U^+^(p)  implies 


A (f ) = 1 (f  - p)(ki+l}(x)  | = I (f  - p) (ki+1) (x)  ! = 0, 


a contradiction  since  f i T . 

n 

Set  G s {k^  c F; (f ) ? 0}  and  let  q = minfk^jk^  e G} 

THEOREM.  Suppose  f e B „\T  is  k + 1 times  differenti- 
— ■Li F n — p 

able.  Then  f has  a unique  best  approximate  in  F-norm 
whenever  q = 0,  while  0(f)  is  a convex  set  of  dimension 
one  if  q > 0.  Furthermore  the  derivative  of  any  best  ap- 
proximate in  F-norm  is  always  unique . 
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Proof . Let  p*  e ft(f)  be  a minimal  polynomial  of  best 
approximation  to  f.  Suppose  p e ft(f).  By  Lemma  1, 
p^i^(x)  = p*^Ki\x)  whenever  x e U (f),  for  each  k.  e G. 

(k  +1)  tk  +1)  1 

Lemma  3 then  implies  p i (x)  = p*  i (x)  whenever 

x e U (f),  k^  e G.  Setting  r = p - p*,  r e then  satis- 

fies the  constraints 


(1) 


(k.),  . (k.+l).  , 

r l (x)  = r i (x) 


0 


whenever  x e IL(f),  k^  e G.  By  Lemma  2,  the  IL(f),  k^  e G, 
must  contain  at  least  n + 1 points,  counting  the  same  point 
over  again  if  it  is  contained  in  another  U.(f).  By  Lemma  3, 
therefore,  the  constraints  (1)  number  at  least  2n  + 2,  since 


U1(f)  n U1+1(f)  = 0 if 


'i+1 


k.  + 1. 
i 


Thus  we  may  form  a 


(2n  = l)-incidence  matrix  E'  from  the  constraints  (1)  by 

choosing  an  x e U (f),  imposing  the  constraint  r^(x  ) = 0 

^ ^ (q4*l)  ^ 

(while  ignoring  the  constraint  l (x  ) = 0) , and  then 

^ (k  ) 

choosing  n other  pairs  of  constraints  r i (x  ) = 
r^i+^(x^)  = 0,  x e U.  (f) , kj  e G.  Form  E from  E'  by 
deleting  the  first  q columns  of  E' . The  resulting 
(2n  + l)-incidence  matrix  E is  strongly  conservative  and 
satisfies  the  weak  Pftlya  condition,  so  by  [2]  is  poised  with 

.(q) 


respect  to  T . Since 


s - r * e T satisfies  E,  s = 0, 
ii  n 

whence  r e must  be  a constant  trigonometric  polynomial. 
Moreover  if  q = 0,  r = 0,  and  p = p*. 

I t is  instructive  to  contrast  the  above  result  with  the 
analogous  algebraic  polynomial  result  of  R.  A.  Lorentz  [3]. 
In  the  latter  case  the  dimension  of  Q(f)  is  always  q, 
with  the  q'jdi  derivative  of  a best  approximate  necessarily 
being  unique.  In  the  trigonometric  case  things  are  more 
civilized:  the  dimension  of  £2(f)  is  at  most  one,  for  any 

value  of  q.  This  surprising  result  leads  one  to  question 
whether  a best  trignometric  approximation  in  F-norm  is 
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necessarily  unique — that  is,  can  q ever  be  nonzero? 
Example . Consider  f e C^(T)  defined  as  follows: 


f(x)  = 


( 


X + TT 

(-2/tt)x^  - x + tt/2 
(-10/tt)x^  - 5x 
-f(-x) 


, -7T  <_  X <_  — 7T  / 2 

, — TT  / 2 <_  X <_  TT  / 4 

, -it/ 4 <_  X <_  0 

, 0 <_  X <.  IT  . 


Considering  approximating  f'  from  = <sin  x,  cos  x> . 

Since  f'  e C(T)  is  an  even  function,  its  (unique)  best  ap- 
proximate in  the  usual  Chebyshev  norm  must  be  an  even  trigono- 
metric polynomial:  a cos  x,  for  some  real  a.  By  inspection 
a = -3  yields  the  best  approximate  to  f'  from  T^,  with 
deviation  2.  On  the  other  hand  q(x)  = -3  sin  x has  a 
maximum  deviation  of  (approximately)  3 - tt/2,  which  is 
strictly  less  than  2.  Thus  q = 1 in  this  example.  Note 
that  any  polynomial  of  the  form  p(x)  = -3  sin  x + c,  where 
| c J < e,  for  e * -1  + tt/2,  is  a best  approximate  to  f here 
in  the  F-norm,  where  F = {0,1}. 
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APPROXIMATION  BY  POLYNOMIALS  WITH  RESTRICTED  COEFFICIENTS 

D.  Leviatan 

The  purpose  of  this  paper  is  to  give  estimates  on  the  rate 
of  approximation  of  functions  by  means  of  polynomials  with 
restricted  coefficients.  Also  discussed  is  the  possibility  of 
approximating  monotone  functions  by  monotone  polynomials  of  the 
above  type  and  estimates  on  the  rate  of  approximation  of  this 
type  are  obtained. 


1 Introduction 

Let  A “ {A^}(k  £ 1)  be  a sequence  of  non-negative  con- 
stants and  set 

n 

PA  »{p:p(x)  = l a^  xk  n = 1,2,. ..JaJ  * A^} 

k»l 

Denote  by  C^tO.l]  the  space  of  continuous  functions  in  [0,1] 
that  vanish  at  the  origin.  Recently  Roulier  [6]  and 
v.  Golitschek  [2]  (see  also  [3])  have  shown  that  in  order  for 
P to  be  dense  in  C [0,1]  it  is  necessary  and  sufficient 
that  there  should  exist  a subsequence  {k^}  of  natural  numbers 
with  the  properties  that 


(1)  £ 1/k  =•  °°  and  lim  A^  » ® 

i-1  i—  ki 

Since  the  rate  of  approximation  of  functions  in  C^ [ 0 , 1 ] 

is  known  by  Jackson's  theorem,  it  is  natural  to  ask  how  much, 

if  any,  is  lost  in  the  rate  of  approximation  by  restricting  the 

polynomials  to  Pft  for  a prescribed  A.  In  view  of  this  Bak, 

v.  Golitschek  and  the  author  [1]  have  called  P.  efficient  if 

A 

the  rate  of  approximation  to  functions  in  CQ[0,1] 

Preceding  page  blank 
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by  polynomials  in  P is  still  described  by  Jackson's  estimate. 
This  does  not  mean  that  a particular  function  is  approximable 
at  the  same  rate  (see  Example  1) . 

We  will  review  some  of  the  results  in  [1]  in  section  2 

while  in  Section  3 we  will  discuss  monotone  approximation  with 

elements  in  P . . 

A 

2 Efficiency  of 

We  restrict  ourselves  here  to  the  case  where  A^  4 0 for 
all  k >.  1 namely  all  powers  are  allowed.  We  have  the  follow- 
ing sufficient  condition  for  efficiency  (see  [1]  Thm.  2). 

THEOREM  1 . Let  r be_  a non  negative  integer  and  6 > 0 . Let  A 
2 

satisfy  A,  >,  6 k , k >,  k . Then  there  exists  a constant 

L k O 

K ■ K(r,6)  Independent  of  A such  that  for  each  f e C [0,1] 
wl  th  f^^(0)*0  i * 0 , . . . , r and  any  sufficiently  1 arge  n 
there  is  an  n-th  degree  polynomial  p^  e PA  with 

(2)  ||f  - pj|  « Kn"r  u,(f(r);i). 


Here  and  in  the  sequel  w(f;.)  denotes  the  modulus  of  con- 


tinuity of  f.  This  condition  is  almost  necessary  in  the  sense 

- 0(k^  C)  for  some  e > 0,  then  P 
c A 

ent.  We  see  this  from  the  following  (see  [1]  Thm  6). 


that  if 


is  not  effici- 


THEOREM  2.  If_  Is  efficient  and  if  « 0(k^  G)  for  some 

e > 0,  then  there  exists  an  infinite  subsequence  k^  such  that 

* i " 1*2,...  . 

i i 


As  was  said  in  the  introduction  this  is  global  efficiency 
and  does  not  mean  that  a particular  function  is  approximable  at 
the  same  rate  as  obtained  with  the  set  of  all  polynomials.  This 
is  illustrated  in  the  following. 
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1/2 

EXAMPLE  1.  Let  f(x)  - x . Then  Jackson's  theorem  guarrantees 

that  the  rate  of  approximation  to  f by  the  set  of  all  poly- 

-1/2 

nomials  is  at  least  of  the  order  n . In  fact  there  exists 
an  n-th  degree  polynomial  pn  such  that 


(3)  | 1 x1/2  - pn(x) 


n 

* n 

k-l 


ill 

k + i 


£ exp  [-  l 1/k] 
k-l 


— 6/2 

If  we  restrict  the  coefficients  we  can  guarrantee  n for 
any  1 £ 6 < 2 provided  we  allow  the  coefficients  to  grow  like 
^2Bk/(2  B)^  name ly ^ * ^2B/(2  B) . Thus  Jackson's  estimate 

is  obtained  with  6-1  and  we  can  get  as  close  as  we  wish  to 
n * by  letting  6 approach  2.  However  it  can  be  shown  that 
(3)  cannot  be  achieved  for  any  B. 


3 Monotone  approximation  with 

Let  P+  and  P^  denote  the  sets  of  nonnegative  poly- 
nomials in  P and  the  set  of  nondcreasing  polynomials  in  P , 
A A 

respectively.  Then  the  generalized  Bernstein  polynomials  pro- 
vide a constuctive  proof  for  Weierstrass'  type  theorems  on 
approximation  with  P+  and  P^.  To  be  precise  we  have 

THEOREM  3.  The  set  P+  J_s  dense  in  the  set  of  nonnegative 

A 1 

functions  in  Cq  [0,1]  and  the  set  P^  is  dense  in  the  set  of 
monotone  nondecreasing  functions  in  Cq[0,1]  _if  and  only  if 
there  exists  a subsequence  {k^}  satisfying  (1). 

Proof . That  condition  (1)  is  necessary  follows  by  Roulier  [6] 
exactly  as  it  is  necessary  for  the  general  case.  As  for  the 
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sufficiency  suppose  (1)  holds.  Then  for  f e Co[0,l]  construct 
the  generalized  Bernstein  polynomial  of  the  sequence  {k^} 
associated  with  f (see  [5]).  Given  e > 0 then  for  n > N 


(4)  | |f  - Bn(f .-) | |<  e/2 


where  B (f,0  is  the  n-th  generalized  Bernstein  polynomial. 
Now 


(5)  Bn(f,x)  - ^ f(anj)  pnj(x) 


j-1 


where 


pnj(x)  ' ('1)n_:l  kn  1/a)nj(ki) 


and 


unj(x)  * (x  - kj)  ...  (x  - kn) 

anj  ’ [(1  ' kl/kj+l)  (1  " k!/kn)J1/kl 


Since  f(0)  ■ 0 there  exists  a 6 > 0 such  that  | f ( an j ) I < e/2 

provided  < 6.  Thus  if  we  put  m = m(n)  to  be  such  that 

a . < 6 £ a , it  follows  by  (4)  and  (5)  together  with 
n , m- 1 nm 

the  inequalities 


nj 


(x)  * 0,  l pnJ(x)  * 1 


J-l 


that 


| ! f (x)  - l f(a  ) p ,(x)|  | < e . 
j-m  J J 

It  has  been  shown  in  [3]  that  the  polynomial 


n 


(6)  p (x)  - [ f(a  ) p (x) 

belongs  to  P for  all  sufficiently  large  n.  Now  if  f is 
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nonnegative  in  [0,1],  then  so  is  pn . Suppose  then  that  f is 
nondecreasing.  Then  by  [4]  (2.2) 


Pni  M 


p„j(x)  - Vl  Pn,j-H(x) 


0 < x * 1. 


Hence  by  (6)  for  0 < x S 1 


p\x)  * x"1  l f(a  ) [X  pnj(x)  - Aj+1  pnj+1(x)l 
j“tn 

+ x *A  p (x)  f(a  ) 
mrnm  nm 


i 0. 

The  following  result  is  obtained  very  easily  from  the 
fact  that  (1)  is  necessary  and  sufficient  for  to  be  dense 

in  the  nonnegative  functions  in  Cq[0,1].  The  reason  we  proved 
the  second  part  of  Theorem  3 is  that  it  provides  a better 
estimate  on  the  rate  of  approximation  (see  Theorem  5) . 

00 

THEOREM  4.  The  set  of  polynomials  p in  PA  with  p * 0 
for  some  k * 1 _is  dense  in  the  functions  f in_  Co[0,l]  with 
f (0)  - ...  = f(k-1^(0)  = 0,  f(k)  * 0 if  and  only  if  (1)  holds, 

will  conclude  this  note  with  an  estimate  on  the  rate  of 
monotone  approximation  for  some  sequences  A. 

THEOREM  5.  Let_  A satisfy  A^  2 6k,  k ^ k^  for  some  6 > 0. 
Then  there  exists  a constant  K * K(6)  independent  of  A such 
that  for  each  nonnegative  or  nondecreasing  f e Cq[0,1]  and 
all  sufficiently  large  n there  is  an  n-th  degree  polynomial 
in  P^  o_r  P^,  respectively,  such  that 

||  f - pn|  | £ Kw(f,n  1^). 
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REGULARITY  OF  SOME  SPECIAL  BIRKHOFF  MATRICES 


G.  G.  Lorentz,  S.  S.  Stangler  and  K.  L.  Zeller 
1 Introduction 


We  shall  discuss  the  Birkhoff  interpolation  problem  for 

algebraic  polynomials  of  degree  _<  n,  for  a set  of  knots 

X:  x,  < x„  < . . . < x and  for  an  incidence  matrix 
12  m 

E - k*0'  Terminology  of  [6]  and  [3]  will  be  used 

without  explanation.  Our  main  problem  is  to  find  when  a given 
matrix  E or  a given  pair  E,X  is  regular  (poised)  or 
singular  (nonpoised,  or  poisonous) . 

After  an  example  in  [4]  of  a three  row  matrix  which  dis- 
proved the  "Atkinson-Sharma  conjecture,"  much  attention  has 
been  devoted  to  three  row  matrices  E of  the  following  type 


(1.1) 


ik 


1 if  i ■ li  0<^k<p;  i = 2,  k = k^, 
i *=  3,  0 <_  k < qj 

0 otherwise 


In  this  case  n * p + q + 1.  Without  loss  of  generality  we 
shall  assume  that  p <_  q,  that  E satisfies  the  Birkhoff  con- 
dition and  contains  two  supported  separated  ones:  0 < k^, 

k^  + 1 < < n.  If  these  inequalities  are  satisfied,  we 

write  E * (p,q;k1,k2).  Papers  [1]  and  [2]  describe  indepen- 
dent investigations  of  DeVore,  Meir  and  Sharma,  and  of  Lorentz 
and  Zeller  about  the  regularity  of  these  matrices.  We  shall 
"almost  solve"  the  problem  of  regularity: 


THEOREM  1.  A matrix  E ■ (p,q;k^,k2)  can  be  regular  only  if 
(1.2)  k^  + k2  ■ p + q + 1,  k2  > q. 
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or  if 


(1.3)  p i ^ < k2  i qs 

in  case  (1.2)  it  is  regular  if  and  only  if  p = q;  in  case 
(1.3)  it  may  be  regular  and  singular,  but  regularity  of 
(p,q;k1,k2)  implies  that  of  (p,q;kj,k')  if  (a) 
p _<  k|  <_  kj  < k2  <_  k^  £ q or  if  (b)  k|  = k^  + 1, 

k2  = k2  + i — q ■ 

Example  in  [4]  is  E = (2,2;1,4),  which  is  regular. 

Our  method,  based  on  properties  of  zeros  of  polynomials, 
applies  to  many  other  cases.  In  particular,  let  E be  an 
(m  - 1)  x (n  + 1)  Hermitian  matrix  with  n - 1 ones 


(1.4) 


'ik 


= 1,  0 £ k < r ^ , i = 1,  . . . , m-1,  Er^  = n - 1, 


and  let  X:  x1  < . . . < x . be  a fixed  set  of  knots.  In 
i m-1 

addition,  let  I be  a "free"  non-Hermitian  row  with  ones 
in  positions  k^  k2>  0 < k.^  k + 1 < k2  < n.  We  shall 

say  that  row  i of  E (or  the  knot  x^  is  a barrier  for  I, 

if  r ^ >_  k2  . 

Adding  I to  the  rows  of  E in  an  arbitrary  position, 
with  the  corresponding  knot  x,  we  obtain  a pair  E',  X'.  In 

this  process,  I is  allowed  to  coincide  with  one  of  the  rows 

i,  provided  i is  not  a barrier.  Then  two  new  ones  are  added 
to  row  i,  in  positions  k^  k?  if  k^  >_  r^  or  r^  k2, 
if  k^  < r^. 

The  following  polynomials  are  essential: 

m-1  r 

(1.5)  P(x,  A)  = (x  - A)  II  (x  - x.)  = (x  - A)Q(x) ; 

i=l 


here  A,  -<»  <_  A <_  +°°  is  a parameter.  If  A = +°°,  P reduces 
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i 


to  Q.  All  zeros  of  a derivative  P are  real.  They  are 

of  two  kinds:  f ixed  zeros  x^,  of  multiplicity  r^  - k, 

i = l,...,m-l,  and  the  simple  variable  zeros,  which  depend 
on  1 and  can  be  obtained  by  Rolle's  theorem.  We  shall  denote 
the  latter  by 


(1.6)  yjk)(A)  < y^k) (A) 


(A), 


(kx) 

their  number  by  z = z(k) . For  the  variable  zeros  of  P , 

(k  ) 

P we  write  simply  y! (A),  y” (A)  . 

We  say  that  the  matrix  E has  property  (R)  if  all  pairs 
E ' , X'  are  regular . 


THEOREM  2.  (i)  A necessary  condition  for  property  (R)  is 

that  E should  have  at  most  one  barrier.  (ii)  Rf  E has 

exactly  one  barrier  i then  the  necessary  and  sufficient 

condition  for  (R)  is  that  z(k^)  = z(k2)  = z and  that  for 

for  each  A i x^  , A / <»,  for  some  s,  0 <_  s <_  z 
o 


(1.7)  y j ( A) 


< yJU) 


ys(A) 


< x. 


ys+l(A) 


y’(A). 


(iii)  _If  E has  no  barriers,  the  necessary  and  sufficient 
condition  is  that  z(k^)  = z(k2)  + 1 = z and  for  each  A ^ °°, 


(1.8)  y | ( A ) < y'^(A)  < ...  < yVjU)  < yJ(A). 


It  should  be  noted  that  similar  results  have  been  given 

(k2-l)  (k2-l) 

by  Lorentz  [2]  in  terms  of  zeros  of  Q , Q , rather 

(k2)  (k2) 

than  P , P . The  difference  with  the  present  Theorem  2 
is  that  conditions  in  [2]  are  only  necessary;  in  contrast  to 
(1.7)  and  (1.8)  they  do  not  contain  A;  they  apply  to  a slight- 
ly more  general  class  of  matrices;  finally,  if  they  are  not 
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satisfied,  apparently  strong  singularity  of  E follows.  It 
would  be  desirable  to  combine  the  two  different  approaches. 

2 Some  identities 

We  shall  use  some  identities  valid  for  the  polynomial 

(2.1)  Q(x)  = (x  + l)P(x  - l)q,  0 <_  p £ q. 

LEMMA  1.  For  integers  p + q = k + £,  0<^k<^£,  the  function 
Q(x)  satisfies  the  symmetry  relation 

(2.2)  |y  Q(£)(-x)  = (-l)k(x  + - l)p-£  -£T  Q(k)  (x) , 

x # +1. 


Proof . First  assume  that  k £ p £ q <^  £.  Then  by  means  of 
Leibniz'  formula  we  obtain 


1_ 

k! 


E (k)il  (P)(x  + 1)P  i(k  - i)! 


(k^)(x  - l)q‘k+i 

(2’3)  k 

= Z (P)(  q )(x+l)p-i(x-  l)q"k+i 
i=0  1 k 1 

- (x  + l)£_q(x  - 1)£_P 

k . . 

^ (JxAKx  + 1)  (x  - l)1, 
i=0 


because  p - k = £ - q,  q - k = £ - p . The  similar  sum  for 
the  derivative  Q has  zero  terms  for  i < £ - q and 
for  i > p.  Omitting  them,  we  get 
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(2.4)  jtQ(£)(x)  = E (±)  (^-1.) (x  + l)P~i(x  - l)q"'e+i. 

i=£-q 

Making  substitution  i = p - j,  we  derive 

JT  Q(^}  (x)  = E (?)(  q )(x  + l)j(x  - l)k_j 
3=0  2 k_J 

and  therefore 

(2.5)  Q(£)(-x)  = (-l)k  E (?)(.  q.)(x  - l)j(x  + l)k"j. 

Cm  j=0  2 : 

Comparing  this  with  (2.3),  we  obtain  the  required  formula. 

The  case  p<^k^£<_q  is  very  similar.  Here,  however, 
the  sum  in  (2.3)  is  only  for  the  range  0 <_  i <_  p;  the  sum  in 
(2.4)  is  for  the  same  range,  as  well  as  the  sum  in  (2.5)  after 
the  substitution  j = p - i. 

(k) 

It  follows  from  (2.2)  that  the  sets  of  zeros  of  Q 

(£) 

and  Q in  (-1,+1)  are  symmetric  to  each  other  with  respect 
to  the  origin. 

COROLLARY  1.  If  k + Z = 2p,  the  derivatives  of  Q(x)  = 

(x^  - I)*3  satisfy  the  relation 


(2.6)  jr  Q(^} (x)  = (x2  - l)p  1 p-  Q(k)(x). 


This  follows  from  (2.2)  if  one  observes  that  the  deriva- 

(£) 

tive  Q is  even  fc>r  odd)  if  the  integer  t is  even  (or 

odd).  (See  also  [3]). 

3 Properties  of  polynomials 

We  shall  need  the  following  well-known  form  of  Rolle's 
theorem,  compare  [4],  [7]. 

THEOREM  3.  For  each  n there  is  a 6,  0<6<l/2  with 
the  following  property.  If  a < 6,  3 - a >_  d,  are  two 


427 


G.  G.  LORENTZ  et  al. 


adjacent  real  zeros  of  a polynomial  P^  of  degree  n,  then 

has  a zero  in  (a  + 6d,8-6d). 

In  particular,  if  all  zeros  of  Pr  are  real,  then  the 

intervals  (a,  a + 6d) , (6  - 6d,  6)  contain  no  zeros  of  P' . 

n 

We  shall  consider  polynomials  P^(x, X)  which  depend 
continuously  on  a parameter  A.  If  all  zeros  of  P^  are 
real,  we  write  them  as  a sequence 


y-j_(X)  < y2(X)  < . . . < yn(X) . 


e. 


It  is  easy  to  prove  that  each  y.(A)  depends  continuously 

1 00 

upon  X.  The  same  is  true  for  the  zeros  of  P (x,A). 

m r . 

THEOREM  4.  Let  P(x)  = II  (x  - x.)  ^ , n _>  2,  m,  r.  >_  1, 

j=l  J J 

x^  < ...  < x^  be  a polynomial  of  degree  n with  real  zeros. 

(i)  The  zeros  of  the  derivative  P'  depend  in  a monotone 

fashion  upon  the  x^ : they  increase  if  all  x increas 

(ii)  If  the  multiplicity  r of  one  of  the  zeros  x^.  = a 
increases . while  the  remaining  multiplicities  do  not  change, 
then  the  x ^ move  away  from  a . 

Proof . (i)  The  derivative  P'  has  two  kinds  of  zeros: 
zeros  Xj  of  multiplicity  r^  - 1,  and  Rolle  zeros, 

X j ’ Xj  < xj  < Xj+1’  °r  zeros 


(3.1)  R(x)  = 


P’(x) 

P(x) 


m r 
I J — 

j-1  X ' "J 


The  function  R is  decreasing  between  its  poles  x ^ . We 
can  assume  that  only  one  of  the  zeros  of  R,  say  a = x j , of 
multiplicity  r,  moves  to  the  right,  to  the  position  a'  > a. 
This  increase  of  a will  make  the  term  r/(x  - a)  of  (3.1) 
larger  both  for  x < a and  for  x > a',  in  particular  around 
any  of  xj . Consequently  xj  will  move  to  the  right. 
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(ii)  Similarly,  enlargement  of  r will  make  the  term 
r/ (x  - a)  of  (3.1)  larger  if  x > a,  smaller,  if  x < a. 
Consequently,  a zero  x^  will  move  right  if  xl^  > a,  and  move 
left  if  xj  < a. 

COROLLARY  2 . Let 

P . 1 

Pi(x)  = (x  + 1)  i(x  - 1)  i,  i = 1,  2,  k <_  £ P2,  q1  >_  q^. 

The  Rolle  zeros  xj , x^.'  of  P^k) , P^  in  (-1,+1) 

satisfy  the  inequalities 


(3.2)  x!  < x"  j = 1,  ...,  k. 

J - j 

Proof.  There  are  k Rolle  zeros  of  the  derivatives,  and  they 
lie  in  (-1,+1).  We  first  consider  Pj,  P^,  with  one  zero 
x|,  x^  each.  We  can  obtain  P2  from  P^  by  increasing  p^ 
and  by  decreasing  q^.  Each  of  these  operations  moves  x| 
to  the  right.  This  proves  our  statement  for  the  first  deriva- 
tives. After  this,  we  apply  the  same  argument  to  Rolle  zeros 
of 

p^-1  q^1 

Pj  = (x  + 1)  (x  - X]_)(x  - 1)  , i = 1,  2 


and  so  on. 


COROLLARY  3.  Let  P(x) 


k Rolle  zeros  x.  of 
J — 


= (x  + l)p(x  - l)q,  k <_  p <_  q . 

Ck") 

PV  j-n  (-1,+1)  satisfy 


Then  the 


(3.3) 


Xj  - Xk+l-j’ 


3 = 1, 


k. 


Proof . In  Corollary 
= +P(-x).  The  zeros 
we  obtain  (3.3)  from 


2 we  take  P^ 
of  P2  are 
(3.2). 


P, 


P2(x)  = 

“Xk-1 


(x  + l)q(x  - l)p 
. . . < -x^,  and 
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We  need  also  a result  reminiscent  of  Markov's  theorem. 

For  a similar  result  and  proof  see  [5]. 

THEOREM  5.  Let  all  zeros  of  the  polynomials  L,  M be  real: 
simple  zeros  x^,  i =1,  n and  zero  0 of  multiplicity  r 

for  L,  simple  zeros  y^,  i=l,  n and  zero  0 of  multi- 

plicity s,  0 <_  s < r for  M.  If  the  zeros  alternate  in  the 
sense  that  for  some  p,  0 p < n. 


(3.4)  x^  <_  < 


< y < 0 
- P 


< y 


.,  < x < 

p+1  — p+1  — 


IV 


then  the  zeros  £^»  of  the  derivatives  L',  M'  alternate 

strictly: 


(3.5)  £.  < n,  < ...  < n <0<n.1<  ...<£. 

11  p p+1  n 

Proof.  It  is  sufficient  to  consider  the  inequalities  (3.5)  in 
(-°°»0].  Assume  first  that  for  some  j < p,  y^  < x,+^.  We 
show  that  for  x e I = (y^ , x j-j-i  ^ we  have  *(x)  < b(x)»  where 


L'  ^ 

r(x) 

n 

_ V 

l 

— L 

i=lX 

- 

n 

v 

i . 

s 

L 

i=lX 

T 

- yi 

x’ 

M 


Indeed,  on  this  interval  we  have 


x - x±  1 x - yt.  i = 1,  ...»  p;  X > X - yp+1; 

x - xi  1 x - yi+1,  i = p + 1,  ...»  n - 1. 

Right  and  left  sides  of  each  of  the  inequalities  have  the 

same  sign  on  I,  hence  l/(x  - x±)  < 1/ (x  - y^ , i = 1 

1/x  < l/(x  - yp_|_1) ; 1/  (x  - x±)  <_  1/(1  - y1+1),  i - p+1,  ...,  n-1. 
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moreover  1/x  <0,  1/ (x  - x ) < 0.  Adding,  we  obtain 

n 

A(x)  < p(x)  on  I.  The  same  argument  yields  A(x)  < y(x) 

for  x e (y  ,0). 

P 

Selecting  j <_  p,  we  now  prove  that  < 0j  • We  can 

assume  that  y^  < (otherwise  ^ — yj  < n j ^ ’ anc*  t^iat 
< Xj+1  (°therwise  < xj+1  £ hj)»  hence  that  £ ^ , 

Hj  e 1 = (yj  * xj+i> ; for  J = p we  have  Cp»  np  t 1 * (yp.o). 

Then  n are  the  only  zeros  of  y,  A on  I.  Since  y(x) 

decreases  on  I,  it  is  positive  on  (yj,nj)*  hence  the  zero 

of  A satisfies  < n j • The  inequality  Rj  < (j  < P) 

follows  from  < Yj  1 x L < Cj+1* 

4 Proof  of  theorem  2 


We  have  to  study  the  dependence  on  A of  the  zeros  of 

the  derivatives  of  polynomials  (1.5).  The  number  z = z(k)  of 

00 

variable  zeros  (1.6)  of  P (x,A)  does  not  depend  on  the 

position  of  A for  A # ®.  This  number  can  be  easily  computed 
if  the  r^  are  given;  we  have  z(k)  > 0 for  0 <_  k < n.  The 
00 

zeros  y (A)  depend  on  A continuously,  and  by  Theorem  4(i), 
strictly  increase  with  A. 

Let  r >_  k,  then  from  Rolle's  theorem  it  follows  that 
1 (k) 

there  is  a zero  y^  in  the  interval  (x^,A)  for  A > x^;  if 

A is  close  to  Xj,  there  is  only  one  such  zero,  by  Theorem  3. 

As  A moves  to  the  left  across  x^ , so  does  the  zero,  and  A 

overtakes  the  zero.  If  A < x. , there  may  be  at  most  one  zero 

00  1 

y^  < xi ' * is  cl°se  to  -00>  such  a zero  must  exist  and 

be  also  close  to  -<*>,  by  Theorem  3.  Similarly  for  A close 

(k) 

to  +°o.  Let  < ...  < be  the  Rolle  zeros  of  Q 

If  A is  close  to  -°°,  the  zeros  (1.6)  approach  the  points 

q^ ^z-1’  ^or  * close  to  -H»  they  approximate 

q ^ , . • • » q z_ ^ » +<x>-  Thus,  as  A moves  from  +°°  to  -00  and  then 

across  -»  again  to  +°°,  the  zeros  (1.6)  perform  a cyclic 

permutation,  y,  being  replaced  by  y,  and  y.  by  y 
J J”1  1 z 
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(kL)  (k2) 

If  there  are  two  derivatives  P , P we  have  to 

consider  zeros  y!,  yV  of  both.  In  this  case,  for  a barrier 
J J 

x^,  RoLIe's  theorem  and  Theorem  3 reveal  that  in  each  small 
interval  (x^,A),  A > x^,  there  is  exactly  one  pair  y] , y1.' 


with  the 


property  x,  < y"  < y!  < A;  similarly  for  (A,x.). 

1 j J 1 


Also,  for  A close  to  -°°,  there  are  zeros  -<*>  < yj  < y^', 
both  close  to  and  similarly  for  A close  to  +■». 

The  P(x,A)  of  (1.5)  represent  precisely  all  nontrivial 
polynomials  annihilated  by  E,X.  Let  X'  = X m {x)  with 
an  arbitrary  knot  x,  and  let  E'  be  the  corresponding  matrix 
with  (n  + 1)  entries  one.  We  have 


(k  ) (k  ) 

(4.1)  P (x, A ) = P (x, A)  = 0, 


precisely 
if  x is 

kl  * V 

as  a zero 
produces  a 


when  P is  annihilated  by 

not  one  of  the  knots  x^, 

If  kj  < r^,  x = x.  and 

y = y^(A)  approaches 
J (r^ 

zero  of  P between  x^ 


E',X'.  This  is  obvious 
also  when  x = x^  with 
is  not  a barrier,  then 
x^,  Rolle's  theorem 

and  y,  and  we  must  have 


(r  ) (k  ) 

(4.2)  P (x,\)  P (x,\)  = 0, 

which  is  then  equivalent  to  (4.1).  This  proves  that  matrix  E 
has  property  (R)  exactly  when  (4.1)  cannot  happen  for  - » < x , 

k < + 00  . 

In  case  (i)  of  Theorem  2,  there  are  two  barriers  i^,  i^ 

with  r , r > k„.  If  \ i (x.  ,x  ),  this  interval  contains 
ll  l2  11  z 

exactly  k^  zeros  y'  and  k^  zeros  y";  for  k in  the  interval, 
these  numbers  increase  by  one.  We  perform  several  translations 
of  k from  a position  k ^ close  to  +«  to  the  left,  and 
over  - oo  back  to  . At  each  step,  a pair  ordered 
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y'  < y"  < x enters  the  interval  from  the  right,  and  another 
i2 

pair  x < y"  < y'  leaves  it  at  left.  Since  k - k.  _>  2, 

^ 1 i ^ 

there  are  two  zeros  y"  contained  between  two  adjacent  y'. 

After  a certain  number  of  steps  we  obtain  a contradiction 
to  this  fact,  if  the  zeros  y',  y"  never  cross,  that  is,  if 

(4.1)  is  impossible.  This  shows  that  (R)  must  be  violated. 

In  cases  (ii)  and  (iii)  we  also  obtain  a contradiction  if 

the  distribution  (1.7)  or  (1.8)  is  violated,  and  (4.1)  never 
happens.  For  this  purpose  we  translate  A several  times  from 
+co  to  -oo  (or  from  -°°  to  +<»)  and  watch  the  pair  y',  y" 
close  to  -oo  (or  to  +oo)  . This  completes  the  proof. 

We  note  that  if  E contains  a row  i,  1 < i < m - 1,  \ 

for  which  k^  < r^,  r^  + 1 < then  E does  not  have  the 

property  (R) . Indeed,  the  matrix  E'  is  singular  for 
x = x^  by  a result  in  [4]. 

If  E has  only  two  rows,  then  by  theorems  of  P61ya 
and  Atkinson-Sharma,  (4.1)  or  (4.2)  can  happen  only  for 
x^  < x < x„.  Thus  we  get: 

COROLLARY  4 . Conditions  of  Theorem  2 are  necessary  and  suf- 
ficient for  the  regularity  of  the  matrix  E = (p,q;k^,k2). 

5 Proof  of  theorem  1 

The  three  row  matrix  E *=  (p,q;k^,k2)  we  treat  by  means 
of  Corollary  4.  We  can  assume  x^  = -1,  X2  = +1,  and  have 
to  consider  the  polynomials 

(5.1)  P (x,  A ) = (x  - A)  (x  + l)!,(x  - l)q. 

Rolle's  theorem  easily  gives  the  number  of  variable 
(Rolle)  zeros  of  P^^: 
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fk  + 1 


if  0 <_  k p 


(5.2)  z(k) 


+ 1 


if  p £ k < q 


p + q + l-  k ifq<_k<_p  + q + l. 


We  have: 

LEMMA  2.  The  matrix  E = (p,q;kj,k2)  is  singular  if 
k^  + k2  = p + q + 1,  k^  £ p,  k2  > q and  p < q. 

This  answers  a question  of  DeVore,  Meir  and  Sharma  [1], 

Proof . Assuming  x = -1,  x„  = 1,  we  consider  the  zeros  y', 

(kxr  (k2) 

y"  of  the  derivatives  P , P of  the  polynomial  (5.1), 

when  < X < 1.  We  have  z(k^)  = k^  + 1,  z(k2>  = k^. 

First  let  A = -1.  Then  yj(-l)  = -1.  Here  P = 

(x  + l)P+1(x  - 1)^.  Since  k1  + V.j  = (p  + 1)  + q,  Lemma  1 

(k1>1  (k  ) 

applies:  the  zeros  of  P , P inside  (-1,+1)  are 

symmetric  about  0.  In  particular,  y^(-l)  = -y^  +^(-l). 

Since  p + 1 <_  q,  we  can  use  (3.3)  from  Corollary  3: 
y2(-l)  1 +1(-1).  Together  this  gives  y2(-l)  £yj(-l). 

On  the  other  hand,  considerations  of  the  first  part  of 
section  4 show  that  for  A close  to  -°°,  y'^(A)  < -1  <_  y2(A). 
Hence  for  some  A,  y ” ( A ) =*  y^(A),  and  E is  singular. 

We  consider  different  cases  of  Theorem  2. 

Case  (i) : two  barriers:  k2  <_  p.  Here  E is  singular. 

Case  ( iii) : no  barriers:  k2  > q.  As  a necessary  condition 

of  regularity  we  have  z(kj)  = z(k2)  + 1 = z.  It  follows 
from  (5.2)  that  this  can  happen  only  if  both  z = k^  + 1 for 
kj  <_  p and  z = p + q + 2-k2  for  k2  > q,  hence  if  kj  + k2 
= p + q + 1.  Lemma  2 gives  singularity  of  E if  p < q. 

On  the  other  hand,  both  in  [1]  and  [3)  it  is  shown  that  E 
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is  regular  if  p = q. 

Case  (ii):  one  barrier:  p < q.  Since  z(k^)  = p + 1 

in  our  range,  the  necessary  condition  z(k^)  = z(k2)  gives 
that  p <_  k^  < q also,  hence  P i ^ < k2  i q.  Here  E can 
be  regular  and  singular.  In  fact,  DeVore,  Meir  and  Sharma 
[1]  gave  simple  necessary  and  sufficient  conditions  for  the 
regularity  of  E when  p = 1. 

To  prove  (a)  of  Theorem  1,  we  note  that  by  (5.2),  for 
p k < l <_  q,  z(k)  = z (SL)  = p + 1,  and  by  Rolle's  theorem, 

(5.3)  yjk)(A)  < yj  £)  (A) , 

for  the  zeros  left  of  +1.  Condition  (1.7)  reads  here 

yJU)  < y^'(A)  < ...  < y^(A)  < 1 < y^U)  < y^U), 

(5.4)  1 < A < +«> 

y|(A)  < y” ( A ) < ...  < y^+1  (A)  < 1,  -°°  < A < 1. 

This  order  can  be  disturbed  only  on  the  left  of  1;  this 
happens  precisely  when  for  some  A and  i < j, 

(5.5)  y”(X)  < y|(A). 

If  this  holds  for  the  pair  k',  k^,  k^  <_  k^  < k2  <_  k^, 
then  by  (5.3)  also  for  the  pair  kp  k2.  The  inequality 

(5.5)  is  reversed  for  A close  to  -<*>.  Hence  (p,q;k^,k2) 
is  singular  whenever  (p,q;k|,k2)  is. 

Statement  (b)  of  Theorem  1 follows  from  Theorem  5. 
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MUNTZ- SZASZ  TYPE  APPROXIMATION  RESULTS 
AND  THE  PALEY-WIENER  THEOREM 
W.  A.  J.  Luxemburg 
1 The  M'untz-Szasz  theorem 

S.  N.  Bernstein,  inspired  by  the  Weierstrass  approxima- 
tion theorem,  raised  in  his  prize-winning  essay  [1]  of  1912 
the  following  question:  Given  an  increasing  sequence 

A = {Xq  = 0 < A^  < < • • • } of  nonnegative  real  numbers 

what  are  the  necessary  and  sufficient  conditions  in  order  that 

An 

the  span  of  the  powers  t (n  ^ 0)  is  dense  in  C[0,1]?  He 

showed  that  the  condition:  Z ( ( 1 + logA  )/A  : A > 0)  = 00 

n n n 

is  necessary  and  that  the  condition:  "The  sequence  A is 

bounded"  is  sufficient.  Bernstein  conjectured,  however,  that 

A j-j 

the  span  of  the  powers  t (n  ^ 0)  is  dense  in  C[0,1]  if  and 

only  if  Z ( 1 / A : n > 1)  = °°. 

n — 

In  1914  Ch . Mdntz  [12]  verified  the  conjecture.  He  used 

the  at  that  time  newly  developed  method  of  the  Gram  determi- 

2 

nants  to  obtain  formulas  for  the  distance  in  L [0,1]  of  the 

A 

function  t to  the  span  of  the  functions  t (0  <_  k <_  n)  . 

The  Gram  determinants  which  arise  in  this  manner  are  of  the 

form  det(l/(a^  + a^  + l)),i,  j = 0,l,2,...,n,  which  are  known 

as  Cauchy  determinants  and  can  be  evaluated  explicitly.  This 

led  to  the  important  result  that  the  distance  d(A)(A  > 0)  in 
2 A 

L [0,1]  of  the  function  t to  the  linear  span  of  the  powers 
A 

t (n  >_  0)  can  be  expressed  by  the  formula 

(1.1)  d(A)  = /2A  + L n (1  - - + x + y) • 

n=0  n 
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Hence,  it  follows  from  the  Weierstrass  approximation  theorem 

that  the  span  of  the  powers  t n(n  0)  is  dense  in  C[0,1]  if 

and  only  if  d(n)  = 0 for  all  n = 1,2,....  This  is  obviously 

the  case  if  and  only  if  Z(l/A  • ^ > 0)  = Q°.  Thus  proving  the 

2 n n 

conjecture  for  the  L -metric.  Using  Fej^r's  theorem  concern- 
ing the  Cesaro  summability  of  the  Fourier  series  of  continuous 

2 

functions  Muntz  was  able  to  show  that  the  L -result  implies  the 
same  result  for  C[0,1],  and  so  settled  the  Bernstein  conjec- 
ture . 

In  1916  0.  Szdsz  [15]  extended  the  result  to  certain  se- 
quences of  complex  numbers  and  at  the  same  time  was  able  to 

simplify  that  part  of  Muntz'  proof  in  which  he  showed  that  the 
2 

L -result  implies  the  C [0, 1 ]-result  by  simply  observing  that 

A,  t -1  Xi~l 

(1.2)  | t°  - la.t  | = (n  + 1)  |/  (xn  - Za'x  )dx| 

0 

(n  + 1)(J  |xn  1 - Za’x  1 |2dx>y2 

0 

There  is  still  another  way  of  approaching  the  Mlintz-Szasz 

theorem.  Observing  that  the  linear  span  of  the  powers 
A o 

t n(n  >_  0)  is  dense  in,  say,  LZ[0,1]  if  and  only  if 

f z L2[0,1]  and 
1 A 

J t nf(t)dt  = 0 
0 

for  all  n 0 implies  f = 0,  we  discover  that  the  Mlintz- 
Szasz  theorem  is  equivalent  to  the  problem  of  characterizing 
the  set  of  uniqueness  of  functions  analytic  in  Re(z)  >0  of 
the  form 

1 

/ tZf(t)dt. 

0 
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Under  the  substitution  t = exp(-x)  the  integral 
1 

/ tZf(t)dt 
0 

transforms  into  an  integral  of  the  form 

oo 

(1.3)  F( z)  = / f (exp(-x)) -exp(-x/2) ‘exp(-x(z  + -^))dx, 

0 

with  g(x)  = f (exp(-x) ) -exp(-x/2)  e L^(0,«>)  and  Re(z)  > -1/2. 

**  n 

We  conclude  that  the  question  whether  the  powers  t (n  >_  0) 

2 

span  L [0,1]  is  equivalent  to  the  question  whether  the  set 

A = {X'.n>0)  of  real  numbers  is  a set  of  uniqueness  of  an- 
n — 

alytic  functions  defined  by  Laplace  integrals  of  the  form 

OO 

(1.4)  F(z)  = j f(t)e  Ztdt,  f e L^(0,«)  and  Re(z)  > 0. 

0 

The  basic  idea  of  this  method  goes  back  to  Szdsz  [15] 
and  has  been  used  by  various  authors  such  as  Carleman  [2], 

Szeg<J  [16],  Paley-Wiener  [13],  p.  32-34  and  notably  L.  Schwartz 
in  his  thesis  [14  ] . 

To  answer  the  question  about  what  sets  A are  sets  of 

uniqueness  of  (1.4)  Schwartz  used  the  following  theorem  of 

Blaschke.  Let  D be  a domain  in  the  complex  plane  and  let 

{z  :n  > 0}  be  a sequence  of  points  in  D.  Then  there  exists 
n — 

a nonzero  bounded  analytic  function  in  D which  vanishes  on 
the  points  z^(n  1 0)  if  and  only  if  Z(G(a,z  ):  z^  / a)  < 
where  G is  the  Green's  function  of  D and  a is  an  arbitrary 
point  of  D. 

For  a domain  of  the  type  Re(z)  ^ e,  e > 0 the  Blaschke 

condition  is  equivalent  to  lim  X =0  and  £(1/X  :X  > e)  < *>. 
M n n n 

Since  for  every  e > 0 and  analytic  function  of  the  type  (1.4) 
is  bounded  in  Re(z)  ^ e it  follows  immediately  that  the 
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condition  Z(l/A  :n  > 0)  = 00  is  sufficient  for  A to  be  a 
n 

set  of  uniqueness.  To  prove  by  this  method  that  the  condition 
is  also  necessary  one  has  only  to  observe  that  if 
E(l/\n’.n  > 0)  < OT,  then  the  function 


(1  + z) 


-2 


n ( (Ak  - z)/ (Ak  + z)) 
n>0 


can  be  written  in  the  form  (1.4)  with  f ^ 0 and  f e L^(0,1) 
for  all  p >_  1 . 

Because  of  the  elementary  nature  of  the  Gram  determinant 
method  the  M\intz-S2asz  theorem  for  the  unit  interval  is  treated 
in  most  textbooks  on  approximation  theory.  If  instead  of  the 
unit  interval  we  consider  an  interval  of  the  form  [a,b], 
a > 0 the  analysis  of  the  Bernstein  problem  becomes  consi- 
derably more  complicated.  Of  course,  the  condition 

E(l/A  ) = » is  still  sufficient  for  the  span  of  the  powers 
A n 

{t  } to  be  dense  in  L^[a,b](p  >_  1)  and  C[a,b].  But  the 
fact  that  the  condition  is  also  necessary  in  this  case  turns 
out  to  be  far  from  trivial.  The  main  reason  for  this  is  the 
fact  that  for  an  interval  [a,b],  a > 0 the  Gram  determinants 
can  no  longer  be  expressed  in  simple  terms.  Furthermore,  in 
this  case  the  complex  variable  method  described  above  leads 
to  another  uniqueness  problem. 

For  special  sequences  of  integers  the  first  proof  that  the 
conditions  is  also  necessary  in  this  case  Was  presented  in 
1943  in  a paper  by  Clarkson  and  Erdds  [3].  They  did  not  use 
the  two  methods  described  above  directly  but  rather  reduced 
this  case  by  an  ingeneous  indirect  argument  to  the  case  of  the 
unit  interval.  This  argument  was  refined  later  in  a paper  by 
J.  Korevaar  [8]  to  include  more  general  sequences.  Also  L. 
Schwartz  in  his  thesis  [14]  gave  a complete  treatment  of  the 
problem  of  approximating  certain  functions  by  sums  of  expo entials . 
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But  also  his  treatment  of  the  problems  for  arbitrary  intervals 
was  indirect.  This  somewhat  unsatisfactory  state  of  affairs 
may  perhaps  explain  why  the  Mlintz-SzAsz  theorem  for  arbitrary 
intervals  has  not  yet  found  its  way  into  the  textbooks. 

Recently,  however,  W.  A.  J.  Luxemburg  and  J.  Korevaar  [10], 
have  shown  that  a direct  approach  via  the  complex  variable 
method  is  very  well  possible.  We  shall  now  discuss  this  ap- 
proach briefly.  For  a similar  but  somewhat  different  method 
we  refer  the  reader  to  W.  Frost  [6]. 

For  the  sake  of  simplicity  we  shall  assume  that  A = 

{A^:n  0}  is  an  increasing  sequence  of  positive  real  num- 

bers tending  to  infinity.  The  method,  however,  works  equally 
well  for  certain  sequences  of  complex  numbers.  We  wish  to 
prove  the  following  theorem. 

THEOREM  1 . (J.  Clarkson-P.  Erdos,  L.  Schwartz,  J.  Korevaar) . 

An  rj 

In  order  that  the  powers  t (n  >_  0)  span  L^[a,b],  p > 1 or 
C [ a , b ] , a > 0 it  is  necessary  and  sufficient  that 

I (1/A  ) = «». 

n 

Proof . Since  the  sufficiency  of  the  condition  is  an  immediate 

consequence  from  the  classical  Mlintz-Szcisz  theorem  we  will  have 

only  to  show  that  the  condition  is  necessary.  To  this  end, 

An 

we  have  to  show  that  if  Z (1/ A ) < °°  the  functions  t 

n 

(n  >_  0)  do  not  span  the  spaces  L^{a,b},  p >_  1 C[a,b]  a > 0. 

A 

By  duality,  this  means  that  for  instance  the  functions  t 

2 

(n  >_ 0)  do  not  span,  say,  L [a,b]  if  and  only  if  there  exists 

2 

a nonzero  function  f t L [a,b]  such  that 
b A 

/ t nf(t)dt  = 0 for  all  n. 
a 

This  fact  suggests  that  we  consider  the  function 
b 

F (z)  = / tZf(t)dt, 
a 
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f e L [a,b]  and  z complex.  Under  the  substitution  t = be 
the  integral  transforms  into 

..  0 

, Z+l  r ..  -u.  -U  -ZU, 

b J f(be  )e  e du, 

0 

where  0 = log(b/a)  < °°,  and  f(be  u)e  u e L^[O,0].  If  we 
replace  z by  iz,  then  we  recognize  that  we  are  dealing  with 
the  Fourier  transforms  of  a function  of  compact  support,  which 
can  be  conveniently  written  in  the  form 

6 2 

(1.5)  G(z)  = j g(t)  exp(+izt)dt,  g e L [0,0]  and  z complex. 

G is  an  entire  function  of  exponential  type  <_  0,  which  is 
bounded  on  the  real  axis.  In  addition,  it  follows  from  the 
celebrated  Paley-Wiener  theorem  [13]  that 

| | G(x)  | dx  < 00  . 

— 00 

Furthermore,  G(iA  ) = 0 for  all  n = 0,1,2,....  This  brief 
n 

deduction  shows  that  in  order  to  prove  that  the  condition  of 
Theorem  1 is  also  necessary  we  need  to  construct  an  entire 
function  of  the  form  (1.5)  which  is  not  identically  equal  to 
zero  and  which  vanishes  at  the  points  iXR  (n  = 0,1,2,...), 
provided  £(1/Xn)  < °°.  The  latter  hypothesis  suggests  to 
consider  first  the  function 

F(z)  = n (l  - 77-) , 
n>0  n 

which  G has  to  contain  as  a factor.  Now  F is  of  exponential 
type  but  of  type  zero  and  can  not  be  written  in  the  form  (1.5). 
Furthermore  for  z = x is  real,  | G(x)  | = n(l  + x^/X^)^^ 

which  tends  to  infinity  as  |x|  -►  «.  It  is  not  difficult  to 
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show,  however,  that 

oo 

/ (log |G(x) | /x2)dx  = (1/A  ) < °°. 

0 n 

We  can  now  use  the  following  fundamental  result.  For  the 
proof  we  refer  to  [10]. 

THEOREM  2.  Let  A = {An:n  >_  0}  be  an  increasing  sequence  of 
positive  numbers  and  let  m = oj(x),  x ^ 0 be  a positive  in- 
creasing function.  Then  for  each  t > 0 there  exists  an 
entire  function  F of  exponential  type  t satisfying  the 
conditions ; 

(1.6)  F(iA  ) = 0 for  all  n = 0,1,2,... 
n 

and  | F ( x)  | <_  exp(-w(  | x | ) , x real  if  and  only  if 

Z (1/X  ) < °° 
n 


and 

oo 

2 

J io(x)/x  dx  < » . 

1 

In  fact,  if  the  above  conditions  are  satisfied,  then  we 

can  construct  a decreasing  sequence  {e^n  L D of  positive 

real  numbers  such  that  Ee  = t and 
n 

F(z)  = n (1  - TT- ) * n cos(e  z) 
n>0  n n>l  n 

satisfies  the  required  conditions. 

Returning  now  to  the  situation  at  hand  we  observe  that 
we  can  apply  Theorem  2 to  the  sequence  A = (Xn)  and  to  the 
increasing  function  m(x)  = log|G(x)|  + /|x | and  take  for  t 
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the  constant  3/2.  Then  we  obtain  a function  F(z)  = 

n(l  - z/iA  ) II  cos(e  z)  which  is  entire  and  of  exponential 
n n>l  n 

type  3/2  and  which  on  the  real  x-axis  satisfies  the  condition 

|F(x)  | <_  exp(-/|  x| ) . Hence,  \ 

■+~  9 1 
/ |F(x)  | dx  < °°  , 1 

—CO 

and  so,  by  the  Paley-Wiener  theorem  there  exists  a nonzero 

2 1 
function  f e L [0,3]  such  that  I 

1 

F (z)  exp(i(3/2)z)  = J exp (izt) f (t)dt 

0 

1 

and  which  vanishes  on  the  sequence  A.  Furthermore,  since 

| F ( x)  | <_  exp(-/| x^)  it  can  even  be  shown  that  f is  a C - ] 

function,  which  settles  the  problem  for  all  L^-spaces  (p  ^ 1)  i 

and  C.  This  completes  the  proof. 

The  direct  approach  via  Theorem  2 of  the  necessity  of  the 

condition  in  Theorem  1 has  various  advantages.  For  instance 

it  can  be  used  to  show  directly  that  if  Z (1/X  ) < » , 

y n 

then  the  distance  of  t to  the  linear  span  of  the  powers 

^ Ic  ^ n 

t (k  £ n)  is  asymptotically  >/b  - e)  , provided  that  the 

sequence  { A^}  is  well-spaced  in  the  sense  that  |A  - A | >_ 

|p  - q | • d , where  d > 0.  For  this  result  and  its  consequences 

we  have  to  refer  to  [10]. 

2 Extensions  of  the  Muntz-Szdsz  theorem 

It  is  a natural  question  to  ask.  What  is  the  form  of  the 
MUntz-Szdsz  theorem  if  we  replace  the  interval  [a,b]  (a  0) 
by  a Jordan  arc  in  the  plane?  The  oldest  result  of  this  kind 
is  due  to  J.  L.  Walsh  [17]  who  showed  that  if  y is  a Jordan 
arc,  then  the  powers  zn  (n  = 0,1,2,...)  span  the  Banach  space 
C(y)  of  all  the  continuous  and  complex  functions  on  y. 

Recently  it  was  shown  by  J.  Korevaar  [9]  and  P.  Malliavin  and 
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J.  A.  Siddiqi  [11]  independently  that  if  A = {A^}  is  an 

increasing  sequence  of  positive  real  numbers  and  if  y is  i 

an  analytic  arc,  then  ^(l/A^)  < °°  implies  that  the  exponentials 

exp (A  z),  z e y and  n = 0,1,2,...  fail  to  span  C(y).  1 

n J 

This  result  is  obviously  equivalent  to  the  statement  that 

if  E(l/A  ) < °°,  then  there  exists  a nonzero  measure  y on  y J 

n I 

such  that  the  entire  function  j 

F(z)  = J exp( zt ) dy (t)  , 

Y 

vanishes  on  A . 

In  [9]  as  well  as  in  [11]  it  is  shown  that  this  theorem 

i 

is  a consequence  of  the  fact  that  on  an  analytic  arc  certain 
functions  can  be  constructed  which  belong  to  a quasi-analytic 

class  on  y.  It  would  be  of  interest  to  determine  whether  j 

the  complex  variable  method  discussed  in  section  1 could  be 
made  to  work  in  this  case.  This  would  amount  to  settling 

the  following  question.  If  £(1/An)  < 00  does  there  exist  1 

a simple  construction  of  an  entire  function  G such  that 

F(z)  = G(z)n(l  - z/A  ) can  be  written  in  the  form 

n 

/ exp (zt) dy (t) , 

Y 

where  y is  some  nonzero  measure  on  Y ? A first  step  in 
settling  this  question  would  be  to  examine  whether  the 
Paley-Wiener  theorem  can  be  extended  to  include  analytic  arcs. 

At  this  time  only  the  following  partial  answer  to  this  prob- 
lem is  available. 

Let  F be  an  entire  function  of  exponential  type,  let 
h (0)(O  < 0 < 2ir)  denote  its  Phragmhn-Lindeldf  function,  let 

r 

$>  denote  its  Borel  transform  and  let  S,  denote  the  con- 
i’ 

jugate  indicator  diagram  of  F. 
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The  classical  Paley-Wiener  theorem  states  that  if  F be- 

2 

longs  to  a class  L on  the  real  line,  then  S is  a segment 

2 9 

and  4>  is  of  class  E on  the  domain  determined  by  the 

2 

complement  of  (for  the  definition  of  the  class  E we 

refer  the  reader  to  [5]).  This  formulation  of  the  Paley- 

Wiener  theorem  suggests  that  the  kind  of  generalization  we 

are  looking  for  is  to  determine  necessary  and  sufficient 

2 

conditions  on  F in  order  that  t>  is  of  the  class  E on 
the  domain  determined  by  the  complement  of  the  convex  hull 
of  a given  analytic  arc.  In  this  direction  we  have  the  fol- 
lowing result. 

THEOREM  3.  (V.  E.  Kacnel'son  [7]),  W.  K.  Delaney  [4]  and 

W.  A.  J.  Luxemburg)  . I_f  F is  an  entire  function  of  ex- 

2 

ponential  type  and  if  $ belongs  to  the  class  E on  the 
complement  of  the  convex  hull  F of  a Jordan  arc,  then  there 
exists  a constant  K > 0 which  depends  only  on  T such  that 

oo 

(2.1)  J |F(rexp(i6))  |2exp(-2rh(0))dr  _<  K j | <t>  (w)  | 2 |dw  | 

0 y 

for  all  0 <_  0 < 2tt,  where  h is  the  Minkowski  support  function 
of  T and  y is  the  boundary  of  r . Furthermore , the 
converse  does  not  hold,  that  is,  there  exists  convex  compact 
sets  T such  as  disks,  and  entire  functions  F of  exponential 
type  for  which  the  left-hand  side  of  (2.1)  is  a finite  and 
continuous  function  of  0 but  for  which  the  right-hand  side 
fails  to  be  finite. 

A detailed  discussion  of  this  result  and  its  relation  to 
the  Muntz-Szasz  theorem  will  appear  elsewhere. 
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APPLICATION  TO  COMPATIBLE  FINITE  ELEMENTS 

Lois  Mansfield 


Error  bounds  in  Sobolev  norms  are  given  for  the  inter- 
polation scheme  presented  in  [1]  and  [2].  Applications  of  this 
scheme  to  finite  element  methods  are  given. 


In  [1]  and  [2]  an  interpolation  scheme  which  interpolates 
to  compatible  boundary  data  consisting  of  values  and  normal 
derivatives  of  order  up  to  m-1  given  on  the  edges  of  a triangle 
T is  introduced  and  analyzed.  Suppose  T has  vertices  P^  = 
(j^i^i),  i = 1,2,3.  Let  E^  be  the  edge  opposite  the  vertex 
P^.  The  interpolant  Qmu  of  [1]  and  [2]  may  be  expressed  as 

(1)  Qmu  = -|[P“  + Vm2  + P3  - Lm]u, 


where  1™  = P^P^P™,  i / j,  j ^ k,  k ^ i,  the  product  being  taken 
f J ^ 

in  any  order,  and  the  projectors  P^  are  the  Hermite  interpo- 
lants  to  values  and  the  first  m-1  directional  derivatives  (par- 
allel to  E^)  on  Ej  and  E^  interpolated  along  parallels  to  E^. 
These  directional  derivatives  can  be  uniquely  computed  from 

the  given  values  f.  and  normal  derivatives  g.  . , j = 1,2,3; 

J J » 1 

i — 1,..., m— 1 . 

By  Lemma 

normal  derivatives  g. 


By  Lemma  4.2  of  [2],  Q u interpolates  given  values  f.  and 

if  and  only  if  this  data  satisfies  the 


J >1 

compatibility  conditions 
,k  „£ 

(2)  J 


8s 


8s7 

J 


— j 
8sT  8s,  P 
J i 


0 < k,£  < m-1. 


at  each  vertex  P with  adjacent  sides  E.  and  E.,  where  8/8s. 

u i 3 th  i 

denotes  directional  differentiation  parallel  to  the  i side. 
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2 

Let  T_  , be  the  3m  -dimensional  set  of  polynomials  which  are 
2m- 1 

of  degree  2m-l  along  parallels  to  the  three  sides  of  T.  By 

Lemma  4.1  of  [2],  lmu  is  the  element  of  T„  , which  interpo- 

2m- 1 

lates  u with  respect  to  the  interpolation  conditions  of  (2). 

The  form  of  Qmu  is  a sum  of  directional  derivatives  of  u 
evaluated  on  9T,  each  multiplied  by  a rational  function.  These 
rational  functions  are  often  referred  to  as  "blending  func- 
tions", and  hence  the  interpolation  scheme  just  described  is 
often  called  blended  interpolation. 

In  [2],  it  is  shown  that  Qmq  = q for  all  polynomials  of 
degree  3m-l  or  less.  In  addition,  error  estimates  are  given 
in  the  sup  norm  for  the  approximation  of  u 6 C , 2m  <_  r <_  3m, 
by  its  blended  interpolant  Qmu,  using  the  Sard  kernel  theorem. 
No  estimates,  however,  were  given  concerning  the  approximation 
of  derivatives  of  u by  corresponding  derivatives  of  Qm.  Such 
estimates  are  obtained  in  the  following  theorem. 

THEOREM  1.  Let  Qmu  be  the  blended  interpolant  to  u € Hr(T) , 

2m  <_  r <_  3m,  given  by  (1) . Then 

(3)  I I u - Qu  | | < — - ||u||  , 0<k<_m, 

H (T)  p H (T) 


where  h is  the  length  of  the  longest  side  of  the  triangle  T,  p 
is  the  diameter  of  the  inscribed  circle,  and  K is  a constant 
independent  of  u,  h,  and  p. 


SKETCH  OF  PROOF.  Using  standard  techniques  in  finite  element 
analysis,  we  map  T onto  the  standard  triangle  T^  with  vertices 
(0,0),  (1,0),  and  (0,1)  using  an  affine  transformation.  This 


gives 
(4)  | 


u - Qmu 


C J 


1/2 


1 k 

hk(t) 


u - Q u 


k 

Hk(Ts) 


where  J is  the  Jacobian  of  the  transformation.  Let 

N = („),  form  a unisolvent  set  such  that  A.  € T for  each  i. 
2 Is 
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The  norm 
I |2 


|v|  2 

L 9 ^ 9 


T " lJlr,2,T  + 
s s 


i=l 


is  known  to  be  equivalent  to  the  usual  Sobolev  norm.  Let  q 
be  the  unique  polynomial  of  degree  r-1  which  interpolates  u 
with  respect  to  the  points  A^.  Then 

u-Qu=u-q-Q  (u  - q). 


We  then  show  that  Qm  € L (Hr(T  );  Hm(T  )).  It  is  clear  that 

A s s 

this  is  the  case  for  L™.  For  it  is  necessary  to  expand 

A A 

various  derivatives  of  u - q in  Taylor  series  along  the  edges 
of  ' 

(5) 

H^(T  ) H1 

s 

-1/2, 


of  T . Thus  we  have 
s 


i * ^ ^ m / ^ ^ . i i i i a /v  i i 

u-q-Q  (u  - q)  | | , 1 C | | u - q | | r 

H (T  ) H (T  ) 

s s 


Hr(T), 


which  when  combined  with  (4)  proves  the  theorem. 

In  proving  that  Q™  6 i(Hr(T  );  Hm(T  )),  one  obtains  the 

s s 

following  useful  inequality. 

LEMMA  1.  Let  u € Hr(Tj,  2m  <_  r £ 3m.  Then 

(6) 


< C 


max 


0<j<L 


,2  . . 1/2 


(D  Q u)]  da) 


0 <_  k <_  m,  where  L = max{m>nri-2-  g } . 

Thus  | | Qmu | | can  be  bounded  in  terms  of  derivatives 

Hm(T) 

of  the  given  boundary  data.  We  shall  give  an  application  of 
this  inequality  below. 

One  may  obtain  finite  elements  by  taking  the  boundary 
data  to  be  polynomials.  The  following  result  is  of  interest. 


THEOREM  2.  (Theorem  3 of  [3])  The  interpolant  Qmu  defined 
above  is  a polynomial  of  degree  k or  less  if  and  only  if  the 
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boundary  values  f and  normal  derivatives  ^ , i = 1 m-1; 

j = 1,2,3,  are  polynomials  of  degrees  k and  k-i  respectively 

which  satisfy  the  compatibility  conditions  (2). 

The  compatibility  conditions  (2)  can  be  used  to  show  that 

polynomial  C^-finite  elements  must  be  of  degree  at  least  five. 

Let  t be  a triangulation  of  a polygonal  region  of  the  plane 

such  that  two  triangles  are  either  disjoin;  or  have  a common 

1 

vertex  or  common  edge.  A polynomial  C -finite  element  sub- 
space S(x,k,l)  is  a subspace  of  piecewise  polynomials  which 
are  globally  in  with  values  defined  locally  on  each  T € x. 
In  particular,  S(x,k,l)  satisfies  the  properties;  (i)  a 
change  in  the  value  of  any  q € S(i,k,l)  at  a vertex  P 
affects  the  value  of  q only  on  triangles  having  'P^  as  a common 
vertex,  and  (ii)  on  any  edge  the  values  and  normal  derivatives 
of  q € S(T,k,l)  are  independent. 

LEMMA  2.  Let  S(t,k,l)  satisfy  properties  (i)  and  (ii).  Then 
k > 5. 


PROOF.  Let  T 6 t have  vertices  Pq»Pj_»P2  lat>eled  so  that  the 

angles  at  Pq  and  P^  are  not  right  angles.  Let  q € S(x,k,l) 

have  values  f.  and  normal  derivatives  g.  on  the  edge  of  T 
J J 

opposite  P , j = 0,1,2.  Since  q is  analytic  in  T,  (2)  must  be 
satisfied.  Thus 


(7)  ol8J(P0)  + f^f"  (PQ)  = [3/3s1(o13/3n  + g^/Ss^q]  (PQ) 

= 3/3Sl(3q/3s2)(P0)  = 3/3s2  Oq/Ss^  (PQ) 

= [8/9s2(a2a/9n  + B29/9s2)q ] (PQ) 

a282 (?o)  + B2f2  ^O5* 

By  properties  (i)  and  (ii),  a perturbation  in  the  value  of  f2 
at  P^  must  leave  f^,  g^,  and  g2  invariant,  and  hence  also 
f"  <PQ>*  A s^:i-gktly  simpler  argument  along  these  lines  shows 
that  f2(Po)  and  f2(Pg)  must  a^-so  remain  invariant  for  any 
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perturbation  in  the  value  of  But  this  is  only  pos- 

sible if  f,  can  be  defined  to  nave  prescribed  values  and  pre- 
scribed  first  and  second  derivatives  at  P , and,  by  symmetry, 
at  P . Thus  f^  must  be  of  degree  at  least  five. 

This  argument  can  easily  be  extended  to  S ( ■ ,k,.£l , £ 1 . 

The  result  contained  in  Lemma  2 had  been  obtained  earlier  by 
2-eniser  in  [7J.  Our  proof  is  easy  to  generalize  to  higher 
d i me n s ions.  See  [ 4 J . 

The  interpolation  scheme  defined  by  0‘  can  be  used  to 
match  exactly  non-homogeneous  boundary  conditions  given  on  the 
boundary  of  a polygon  in  trie  finite  element  approximation  of 
solutions  of  elliptic  differential  equations.  This  has  been 
considered  by  Mitchell  and  .Marshall  in  their  papers  [S]  and 
[6]  and  in  the  appendix  of  ['3j.  To  obtain  error  bounds,  one 
is  led  to  consider  the  approximation  of  a function  u C H f'f ) , 

2rn  x ' '3m,  by  a blended  interpolant  O'  'u  which  interpolates 
to  values  f and  possibly  normal  derivatives!  on  one  side,  say 
h , , of  a triangle  'J  , and  to  polynomial  approximations  to  u 
( and  possibly  normal  derivatives!  on  the  other  two  sides.  Let 
us  also  assume  that  the  triangulation  - is  regular  in  the  sense 
that  for  any  'J  C t,  its  smallest  angle  v.  satisfies  0 ' uf  ' u . 


THLOBLM  '3.  Suppose  tliat  on  interior  triangles  finite  elements 
are  used  for  which  one  lias  the  f ol  lowing  error  bounds : 

(fJ‘)  I I u “ "u ' I 1 Khr  r‘‘  | u | | 

H (T)  II  (T) 

(9)  |u  - tt u J , ,,  , ' Khr  ^ ^ u | j , 2m  ^ r '3m, 

’’  Hr_iOT)  0'_k:», 


where  'ui  is  the  interpolant  to  u from  the  space  of  finite 
e 1 e rne n t s with  respe c t_  t o some  interpolation  scheme.  Then 


(10; 


- Qmu  I 


Hm(Tl 


Kh 


r-m  i 


Hr(T) 
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SKETCH  OF  PROOF.  We  add  and  subtract  ttu  and  use  (6).  Partial 
nm  a a a 

derivatives  of  Q (ttu  - u)  on  the  edge  can  be  expressed  in 

terms  of  tangential  derivatives  and  9/3v_.,  the  transform  in 

T of  3/3n.  under  the  affine  transformation  which  maps  T onto 
s J 

T . Note  that  3/3v.  depends  upon  the  geometry  of  T,  but  this 

s J _1 

dependence  can  be  expressed  in  terms  of  (sin  a ) , which  we 

have  assumed  is  uniformly  bounded  away  from  zero  for  all  T € t. 
Thus 


Qm(7TU 


- u) 


Hm(T  ) 
s 


< C max  (/*  (g^t^  i'> 
0<i<m-l  E1  ljl 
0 <£<m- 1 
0<j<L 


-L  9 1 


C' 


r » 2,T 
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CONVERGENCE  RESULTS  FOR  CARDINAL  HERMITE  SPLINES 


M.J.  Marsden  and  S.D.  Riemenschneider 


By  first  investigating  the  behaviour  and  exact  limit 
properties  of  the  Fourier  transforms  of  the  fundamental 
splines  for  cardinal  Hermite  interpolation,  results  concern- 
ing the  convergence  as  the  degree  tends  to  infinity  can  be 
readily  obtained. 

Let  S„  denote  the  class  of  functions  S(x)  such  that 
9 it 

2 m 2^— 

S(x)  e C (R) , and  S(x)  e ir„  , in  each  interval 

Zm-1 

[ v , v+1 ] , v e Z (Z  denotes  the  integers).  Given  an  r-tuple 

of  sequences,  (y°,***,yr  ^) , yS  = {y^:  v e Z},  the  cardinal 

Hermite  interpolation  problem  is  to  find  an  (unique)  element 

S(x)  e S such  that  S^(v)  = yS  for  s = 0,l,***,r-l 

2m,  r v 

and  v e Z.  P.  Lipow  and  I.J.  Schoenberg  [3]  established 

that  if  the  sequences  are  of  power  growth,  i.e.  y^  = 0(|v|Y) 

as  v -►  +°°,  then  there  is  a unique  spline  S(x)  e S„ 

2m, r 

that  solves  the  interpolation  problem  and  is  of  power  growth, 

| S ( x ) | = 0(|x|y)  as  | x | °°.  Moreover,  this  spline  is  given 


by  the  formula 

(1)  L2m,r(y°’*'*’yr"1)(x) 


r-1 


L J,  yv  L2m,r , ! 


(x-v) 


s=o  veZ 


where  the  fundamental  splines  L„  (x)  are  uniquely  deter- 

2m, r , s 

mined  by  L^P^  (0)  = 6 , and  L^1  ^ (v)  =0,  v / 0, 

Zm,r,s  p,s  Zm,r,s 

p = 0,1, •• • ,r-l. 

Recently,  S.L.  Lee  [2]  has  established  the  Fourier  trans- 
form representation  of  the  fundamental  solines  as 


Preceding  page  blank 
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(2)  L_  (x)  = 

2m,r,s  2tt 


s f+°° 


H.  r (u)e1UXdu 
2m,r,s 


where  H„  (u)  is  the  quotient  of  two  Hankel  determinants. 

2m,r,s 

More  precisely,  the  denominator  is  the  determinant  of  a matrix 

whose  (i,j)-entry  is  + 2rk]  l)+(j  1)^ 

i,j  = l,***,r,  while  the  numerator  of  H.  is  the 

2m, r , s 

determinant  of  the  matrix  obtained  from  the  denominator  matrix 

s t 

by  replacing  the  (s+1)  -column  by  the  column  vector 

-2m+r-l . T 
' ,u  ) . 


, -2m  -2m+l 

(u  ,u 


The  first  result  is  a technical  lemma  that  makes  the 

dominant  terms  of  H„  more  visible. 

2m,r ,s 

LEMMA  1.  rf  u/tt  .is  not  an  integer , then 


l 

(i) 


V(i)  V1(i)  V2(i) 


(3)  H_  (u)  = 
Z m y r y s 


n 

1^.1  <8+1 


(x+i . ) 


2m 


s+2<j<r 


(x+i.) 


2m-s-l 


(2tt)S  Y V2(k)  n (x+k.)  2m 
k l<j<r  2 


where  x = (u-2tt£) /2tt  and  H satisfies  (a)  2£+r  ijs  an  odd 

integer  and  (b)  | u-2tt£  | <ir;  (i)  denotes  a distinct  set 


i^»***,ir  satisfying  (c)  2i^+r  is  an  odd  integer,  (d) 


X1  < X2  < 


< l 


(e) 


Vl  = ^ (f)  is+l  " 


< V 


the  denominator  sum  is  over  distinct  k,  < k < • • • < k 
12  r 

satisfying  2k. +r  is  an  odd  integer;  and  V(i)  = V ( i ^ , • • • , i^ ) , 
(i)  = V(i^ , • • • , ig) , V2(i)  = (ig+2, • • • ,ir)  and 

V(k)  = V(k^,***,kr)  are  Vandermonde  determinants. 

With  the  aid  of  the  representation  (3),  the  following 
properties  of  ^ g(u)  can  deduced  by  a "pairing"  argu- 

ment on  dominant  terms. 

LEMMA  2.  (a)  H (u)  = 0 fmin (1 , (rir/u) 2m)  1 as  m -+  °°. 

zm,r,s  1 ' — 
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(b)  Var (u  H, 


(u)]  = 0(1)  as  m -+  <»  where 


2m,r ,s 

p = 0, 1 , 2 , • • • , and  the  constants  depend  only  on  p , r and 


Since 


< 1/2  in  (3),  the  dominant  term  of  the 


denominator  is  given  by  the  set  n = ) where 

rij  = j-(r+l)/2.  In  pairing  this  term  with  the  numerator 
terms,  it  can  be  shown  that  in  the  limit  as  m -»■  +<*>  the  only 
remaining  terms  are  those  for  which  (i)  coincides  with  n as 
sets.  Defining 

(4)  Q (u)  = (2tt)_S  l V(i)V  (i)V  (i)V-2(n)  n (x+i.)S+1 

(i)=n  s+2<j<r  J 

on  the  interval  | u— 2tt £ | <tt  where  | £ | j<_(r— 1 ) / 2 and 

x = (u-2tt£)  / 2 tt  , g(u)  = 0 for  |u|>r7T»  ar>d  g ( 2 tt  r)  ^ -t-n ) 

= [Q  (2tth  +tt-0)+Q  (2TTn.  1-tt+0)]/2,  j=0,l,-*-,r,  we  have 

PROPOSITION  3.  The  relation 

(5)  lim  H (u)  = 0 (u) 

m - co  2m’r’s  r’s 

holds  for  all  u.  Moreover,  the  convergence  is  uniform  out- 
side any  neighbourhood  of  the  points  2nr)^+v,  j = 0,l,***,r. 

Letting  F denote  the  Fourier  transform,  Lemma  2(a), 
Proposition  3 and  (2)  combine  to  yield 


LEMMA  4.  The  relation 

(6) 


(p) 


lim  L;"'  (x)  = (-i)S  F(Q  )(p) (x) 

2 m j r j s r , s 


holds  uniformly  in  x for  each  integer  p ■>  0. 
If 


P P 

£ ®...$^£  denotes  the  Banach  space  of  r-tuples 


, ° r-1  c 

(y  , • • • ,y  ) of 


P nil 

% sequences  with  norm  E 1 1 y | 


md 


1 K 

WP’  (R)  denotes  the  Sobolev  space  of  functions  f for 
which  f^r  ^ is  absolutely  continuous  and  f^  e LP(R), 
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j = with  the  norm  Z||f 

Schoenberg  [3]  have  shown  that  L 


(3) 


| , then  Lipow  and 

LP 

_ is  a linear  operator 

. 2m, r 

from  to  WP,r  (R) . Following  [4],  Lemma  2(b) 

and  Proposition  3 can  be  used  to  obtain  convergence  results 
in  these  spaces. 


THEOREM  5.  The  operator  L^m  r defined  by  (1)  .is  a bounded 

linear  operator  from  to  WP,r  ^(R),  1 < p < +°°, 

whose  norm  is  bounded  independently  of  m. 

THEOREM  6.  If  W^y0,-**, 

(x-v) , then  the  relation 


THEOREM  6.  If  W (y°,''-,yr  1)  (x)  = Er  1 yS(-i)SF(0  ) 

— r J s=o  v=-°°  Jv  r,s 


(7)  lim  II  ^m.r^0’  * * " ,yf  ^ ~ ^r(y0 ’ ,yT  b ||p  r_1  = 0 


m 


holds  for  each  (y  , 


,yr_1)  c A 


t , 1 < p < 


S (s)  s 

If  the  data  {y  } is  given  by  f ' (v)  = y , v e Z, 

s = 0,***,r-l,  where  f is  a "nice"  function,  then  uniform 

convergence  results  can  be  expected  in  light  of  Schoenberg  [5]. 


THEOREM  7.  Let  f(x)  be  defined  by 
r-1  , ,Ns  r rir 
(8)  f(x)  = l 1 


s=o 


~r  tt 


Q (u)e1XUdB  "(u) 
r , s s 


where  3g  is^  the  2TT-periodic  extension  of  a_  bounded  measure 
on  The  relation 

(9)  lim  L2m  r(f,f,,**-,f(r_1))(p)(x)  = f(p)(x) 
m ->  °°  ’ 

holds  uniformly  in  x for  each  integer  p ^ 0. 

The  proof  follows  from  an  identity  which  allows  the 


(j) 


identification  fVJ'(v)  = f.(v),  v e Z where  f . (x)  = 

fTT  ixu  ^ ^ 

^ e d3_.(u).  Then  a summability  argument  (see  [1]  and 


1/2,  I] 

[6])  yields 
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r-1 

= 1 

s=o 


(-1) 


s r 


9 H (u) (iu)Peixude^ 

2 ti  2m,r,s  s 

' —oo  * ’ 


and  the  convergence  properties  of  H„  provides  the 

2m,r,s 

result . 

Using  the  same  sort  of  argument  provides  the  theorem 


THEOREM  8.  The  following  classes  of  functions  are  equivalent 
for  1 < p _<  2 : 

(a)  WP’r  ^(1R)  n { f ( x ) : f is  of  the  form  (8)} 

(b)  { f (x) : f (x)  = Wp’r'1-iim/.2mjr(y°,---,yr"1)(x), 

. o r-1 v p P , 

(y  »***»y  ) e 

(c)  { f (x) : f ( x ) = Wr(y°, • • • ,yr  1)(x),  (y°,*-*,yr  1)  e 

£Pffe-  • J£P  } . 

r 

In  conclusion,  some  explicit  expressions  for  Q (u) 

r , s 

are  noted:  (r=2) 

Q2  0(u)  = (2tt—  |u  I )/2n,  | U | <_2 tt  and  Q2  ^(u)  = sgn  u, 
for  0 < | u | < 2n; 

( r=3)  Q3  q(u)  = (27r)_2M3(u)  - (l/8)M’^(u) 

Q3  l(u)  = -(2tt)"2M^(u)  and  Q3  2(u)  = - (2v)~2H”(u)  / 2 

for  0 <_  | u | _<  3tt  except  u = ±ir,  + 3ir,  where 

2 

f (u+3tt)  /2  — 3tt  <_  u <_  -ti 

M3(u)  = < [ (u+3x)  2-3  (u+Tl)  2 ] /2  -71  _<  U <_  TT 

L ( 3tt— U ) 2 / 2 TT  <_  U _<  3tt  . 

In  general,  Q (u)  will  be  a combination  of  the  derivatives 

L h *”  * ® 

of  the  r B-spline  M^. 
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n-WIDTH  UNDER  RESTRICTED  APPROXIMATIONS 


Avraham  A.  Melkman 

The  purpose  is  to  find  optimal  approximating  subspaces  to 
a set  of  functions,  when  restrictions  are  placed  on  the 
approximation.  Two  instances  are  outlined.  The  first  requires 
the  interpolation  of  the  function  at  a fixed  set  of  points  and 
may  be  connected  to  problems  of  optimal  recovery.  The  second 
concerns  monotone  approximations,  the  main  result  involving 
the  case  when  polynomials  are  optimal. 

In  the  usual  n-width  problem  (in  the  sense  of  Kolmogorov) 

one  is  given  a subset  A of  Banach  space  X and  the  problem 

* 

is  to  identify  an  optimal  n-dimensional  subspace  X^  of  X 

* 

from  which  to  approximate  A,  ef  Lorentz  [2].  Thus 
achieves 

d (A)  ■*  inf  sup  inf  |lx-y||. 

n X x£A  y£X 

n n 

Since  it  is  often  desirable  to  have  the  approximation  y f 
satisfy  some  side  conditions,  it  is  natural  to  consider 
corresponding  n-widths. 

1.  Interpolatory  conditions. 

This  section  is  based  on  joint  work  with  C.  A.  Micchelli. 
Take  X - L2[0,1],  A ■ ■ (f  |f  6 w!;[0,l],  [|f(r)|l2  1 D a”d 

let  Tn  be  a fixed  set  of  points  0 <_  t^  <...<  t^  <_  1.  Given 
f £ W2  and  Xn  we  now  consider  only  those  approximations 
y 6 which  interpolate  f at  T^, i. e.y (t^)“f (t^) , i"l, . . . ,N, 
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and  denote  the  corresponding,  interpolatory , n-width  by 

d (W*;T  ). 
n 2 N 

THEOREM  1.1.  Let  U^CT^)  an^  v^(x)  b£  the  eigenvalues  and 
eigenvectors  of 


v^2r^(x)  = (-l)rp  v (x ) + X a A (x-t  ) 

i=l 

v(J)(0)  = v(j)(l)  = 0,  j = r, . . . , 2r-l ; v(ti)=0,  i=l,...,N 

-r  -4  1 

a free.  Then  d (W„;T  ) = p (T  ) . 

i n 2 N n N 

Furthermore,  let  £ and  n. be  the  (simple) 

l n ^ n-r  — — — 

zeros  of  v^Cx)  and  v^  (x)  respectively,  n >_  N.  Then  op- 
timal n-dimensional  subspaces  are 

i.  natural  splines  of  degree  2r-lwith  knots  at  

ii.  ordinary  splines  of  degree  r-lwlth  knots  at  

— I* 

In  both  cases  d^CV^jT^)  — achieved  by  interpolation  at 


5 1 » • • • » £ • 

l n 

Conversely  these  results  yield  as  a byproduct  tighter  L^- 
bounds  on  the  error  in  natural  spline  interpolation,  by  expli- 
cit estimation  of  For  example  in  cubic  natural  spline 

2 (2) 

interpolation  one  obtains  a bound  of  (A/tt)  ||f^  l^,  halving 
Schultz’s  estimate  [5], 

J(  — f 

It  is  natural  to  ask  which  T__  minimizes  d.. (W_;Txt). 

N N z N 

Clearly  a lower  bound  for  the  latter  is  d^V^). 

THEOREM  1.2.  Let  X^.u^Cx)  be  the  N-th  eigenvalue  and  eigen- 
function of  u(2r)(x)  “ (-l)ru(x),  u^^(0)  = u(j)  (1)  = 0, 

★ 

j=r, . . . , 2r-l,  characterizing  the  unrestricted  N-width.  Then  Tf] 
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Is  the  set  of  zeros  of  u^(x)  and  un(Tn)  “ *N» 

„ * 
COROLLARY  1.1.  The  spline  spaces  of  theorem  1 . 1 with  are 

optimal  subspaces  for  the  unrestricted  N-width  problem,  in 

addition  to  the  classical  eigenfunction  subspace . For  example : 

cubic  natural  spline  interpolation  at  equally  spaced  points  is 

-3 

an  optimal  approximation  procedure  for  This  gives  a con- 

crete example  of  nonuniqueness,  an  issue  discussed  by  Karlovitz 

[1]. 

These  results  may  also  be  applied  to  the  problem  of  L2~ 
optimal  recovery  of  a smooth  function  from  a sampling  of  its 
values  pet  T^,  along  the  lines  of  Micchelli,  Rivlin  and  Wino- 
grad  [4].  Briefly,  the  interpolation  procedures  of  theorem  1.1 
provide  optimal  recovery  schemes;  the  optimal  choice  for  an 
additional  sampling  point  is  at  the  additional  zero  of  vN+^(x); 
and,  the  most  information  is  gained  by  putting  the  sampling 
points  at  the  zeros  of  the  function  which  is  hardest  to  recover. 

2,  Monotone  approximation  of  monotone  functions. 

Take  X = LjO.l],  A = Mr  = (f|f  € W*,  f'  >_  Ol  and  restrict 
the  approximations  to  be  monotone,  denoting  the  corresponding 
n-width  by  d^+(Mr). 

THEOREM  2.1.  d (M  ) = Id  (W  ),d  (M  ) » d (W  ) 

d (Wr)  < d +(Mr)  < c d (Wr)  for  some  c , r > 2. 
n 00  n r n — r* 

The  proof  uses  the  observation  that  the  unrestricted  n- 

T ~T 

widths  of  M and  W are  the  same.  Further,  for  r = 1,2 

00  ' 
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-j- 

the  optimal  approximation  procedure  for  is  one  of  piece- 

wise  linear  interpolation  at  equally  spaced  points  which  for  M 

yields  a monotone  interpolant.  For  r > 2 the  upper  bound  is 

a restatement  of  DeVore's  result  (these  proceedings)  that  the 

polynomial  Jackson's  theorems  remain  valid  even  with  the  mono- 

tinicity  constraint.  However,  this  yields  no  knowledge  of  the 

* 

"worst  approximalie  functions"  or  exact  values  of  d^.  The  int- 

— r 

erpolatory  method,  so  successful  for  W^,  cannot  work  here  for 
it  cannot  ensure  the  monotinicity  of  the  interpolant.  It  seems 
best,  rather  than  good,  approximations  have  to  be  used,  a form- 
idable task.  We  therefore  limit  ourselves  here  to  some  results 
"4"  r 

concerning  d^(M  ) when  the  optimal  subspace  has  to  consist 
of  polynomials  of  degree  r-1 

d^(Mr)  = sup  inf  ||f-p  , IL* 

r f€Mr  p*  >0 

r-1— 

* 

Lorent  and  Zeller  [3]  showed  that  Pr_^  is  a best  monotone 

approximation  to  f if  and  only  if  there  exist  a^, 

i * i 

x^ , i=l, . . . ,m  and  b^.y.^,  i=l,...,&  such  that  | f (xi)_Pr_1 (x^) | * 

||f  - Pj-.JI*  P*-l^yi^  * °»  bi  > 0 » m + £ 1 r + 1 and 

U)  ™ a q(x  ) + Z b q'(y.)  = 0 
i-1  i=l 

for  any  q a polynomial  of  degree  r-1.  This  condition  together 
with  a consideration  of  duality  can  be  used  to  prove  a result 
stating  roughly  that  only  functions  with  many  flat  spots  are 
candidates  for  the  worst  function. 

I 

LEMMA  2.1.  f £ Mr  can  be  a worst  approx imahlg  function  only  if 
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Q = f - Pr_^  is  £ perfect  eguioscillating  spline  of  degree  r 
with  m + 2&  - r - 1 knots,  satisfying  Q' (y^)  = Q"(y^)  = 0, 

i = 1 1,  Q(xi)  = (-1)1  ||Q||,  i = 1, . . . ,m. 

The  value  ||Q||  is  a local  maximum  in  the  sense  that  any 
function  close  to  f can  be  approximated  at  least  as  well. 
There  are  therefore  many  local  maxima,  corresponding  to  the  po- 
ssible values  of  m and  i.  , cf.  Lorentz  and  Zeller  [3],  and 
the  problem  is  to  identify  the  global  one.  Intuitively  one  ex- 
pects the  worst  function  to  be  such  that  its  best  approximation 
is  hampered  most  by  the  monotinidity  constraint.  Indeed 
LEMMA  2.2.  If.  f ijs  such  that  its  best  monotone  approxima- 
tion has  21  <,  r - 2 then  there  exists  a function  which  is 
worse  approximable . 

These  results  lead  to  the  following  characterization. 

+ y 

THEOREM  2.2.  For  r even,  d^_(M  ) = US^H^  where  S is^  a_ 

polynomial  of  degree  r equloscillatlng  3 times  at  0,x  , 1, 

with  S'(y,)  * S'  (y,)  = 0,  i=l, . . . , (r-2)/2,  = 1,  and  such 

r i r i r 

that  0,x^,l  and  y^, i=l, . . . , (r-2)/2  support  a formula  (1). 

The  existence  of  such  an  S is  ensured  by  theorem  16  of 
Lorentz  and  Zeller  [3],  and  it  is  unique  up  to  a reflection. 

For  r odd  a similar  result  holds  with  an  additional  condition 

s;(o)  = 0. 
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THE  EXACT  ASYMPTOTIC  VALUE  FOR  THE  N-WIDTH 
OF  SMOOTH  FUNCTIONS  IN  L°° 


Charles  A.  Micchelli  and  Allan  Pinkus 


In  this  paper  we  answer  a question  raised  by  Chui  and 
Smith  and  obtain  the  exact  asymptotic  value  for  the  N-width 
of  the  set  D = { f : | | L f [ < 1,  f e W^  [0,1]}  where  L is 

an  r-th  order  totally  disconjugate  differential  operator 
and  | | • ||  = sup  norm  on  [0,1]. 

1.  Introduction 

Let  W^[0,1]  = {f:f^r  ^ absolutely  continuous  on  [0,1], 

f^  e L [0,1]},  \ e Cr  Ho,!),  j =1 , . . . , r , and  (Lf)  (x)  = 
r . 

it  (-; 1-  X.(x))f(x).  The  N-width  of  the  set 

dx  j 

(1)  Dr  = { f : f £ v£[0,l],  | | Lf  | < 1} 

(relative  to  C[0,1])  is  defined  by 

(2)  d (D  ) = inf  sup  inf  I I f-g I I , 

*N  feDr  *eXN 

where  the  infimum  is  taken  over  all  N-dimensional  linear 


subspaces  X^  of  C[0,1]. 

The  purpose  of  this  paper  is  to  prove  the  following 
theorem  which  answers  a question  raised  in  Chui  and  Smith  [1]. 
Let 


e 

r 


2 

r+] 

TT 


r (-»k(r+1) 

-«  (2k+l) r+1 
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THEOREM.  For  an^  r > 1, 

lim  Nrd  (D  ) = e 
,,  Nr  r 

N-*» 

The  proof  of  the  above  theorem  relies  on  several  results 
of  [3]  which  we  summarize  below. 

2 . Proof  of  theorem 

Let  K(x,y)  be  the  Green's  function  for  the  initial 
value  problem 

(Lf)  (x)  = h (x) 
f (l)  (0)  = 0,  i-0,1. • • .,r-l. 

Also,  define  ko(x),...,k  ^(x)  as  the  unique  set  of 

functions  in  the  null  space  of  L satisfying  the  conditions 

kij)(0)  = 6ij’  i.^0.1.-  -..r-i. 

The  Green's  function  K(x,v)  has  the  property  that 

!0  , x < y 

H(x,v),  x > y 

where  for  each  fixed  y,  H(x,y)  is  in  the  null  space  of  L, 
as  a function  of  x. 

Thus  D^_  has  the  equivalent  representation 
r-1  /•! 

(3)  D = { [ a k (x)  + K(x,y)h(y)dy : ( |h| 1^  < 1, 

j=0  *0  r 

(a~,...,a  .. ) £ R } • 

o r- 1 

S.  Karlin  proves  in  [2]  that  for  every  integer  s s 0 
there  exists  a function 
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r-1 


(4)  P (x)  = l b k (x)  + l (-1) 


j=0 


1 J 


j=0 


j; 


J 


K(x,y)dy 


o - 50  ‘ «1  <...<  5S  < Cs+1  -1 


which  equioscillates  r+s+1  times,  that  is, 
i+1 


W = ^ 


psl  L»  i = 1,  ••  • , r+s+1. 


for  some  points  0 < t,  < t„  <...<  t , , , <1  (see  also  [3]). 

1 2 r+s+1 

We  will  denote  by  the  class  of  all  functions  which  may 


be  expressed  as 
r-1 

P(x)  = l a k (x)  + 


1=0 


3 3 


f (-D3  rj+i 

j»0  Jr. 


K(x,y)dy 


for  some  constants  (a_,...,a  , ) e R and  points 

0 r-1 

0 = n„<  n,  <...<  n„  <10.1  = 1,  £ ^ s.  Then  P has 
01  £ £+1  s 

the  following  properties: 

(5)  l|PsIL  s l|P|L»  P € Qs, 

and 


(6) 


min  | f (x  ) I 

l£ j<r+s+l  1 


< 


where  f is  any  function  in  such  that  for  some  points 

0 < Xj^  <...<  xr+s+1  < 1,  f(xi)f(xi+1)  £ 0,  i=l,  ...» r+s . 

The  importance  of  the  function  Pg  rests  on  the  equation 

(7)  dN(Dr)  = ||PN_rIL,  N > r, 

which,  along  with  (5)  and  (6),  was  proven  in  [3]. 

We  are  now  prepared  to  prove  the  theorem. 
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PROOF.  For  every  integer  N,  let 
N-l  . 

GN(x)  = I (-1)1  K(x,y)dy. 

.1=0  J * 


We  claim  that  there  exists  a v^  in  the  null  space  of  L 

such  that  for  the  function  Hx,  = G„+v„ 

N N N 


lHNIL  = er 


(8)  lim  Nr 
N-*=° 

N N 

and  there  exist  N point  0 ^ x^  <.  . . < x^  < 1 such  that 


(9)  lim  Max 
N-xx>  l<i<N 


NrHN(x^)  - (-l)i+1 


= 0. 


These  facts,  together  with  (5),  (6),  and  (7)  imply 


mm 


H 


N+l  i 


(x/+1)|<  dN(Dr)  < 


H 


N-r+1  1 00 


l<i<N+l 

Thus  we  conclude  the  validity  of  the  theorem. 

Let  us  then  prove  (8)  and  (9).  Recall  that  the  r-th 


Euler  polynomial  is  defined  by  the  relation 

r 

x e R . 


2x 


E (x+1)  + E (x)  = 
r r r! 


? » 


,(r) 


Here  we  have  normalized  E so  that  E (x)  =1.  We  per- 

r r 

form  the  usual  surgery  on  E and  define  E^  by 


(10) 


E^(x)  = E^(x)  , x e [0,1] 
E^(x+1)  = -E^Cx) , x e R. 


It  is  evident  that  we  may  express  as 


G (x)  = — ( (—  E 

N Nr  JO  dyr  r 


(Ny))  K(x,y) dy . 


Our  next  step  is  to  integrate  the  above  expression  by  parts. 

J* 

For  any  f e W^IO.l],  there  exists  constants  Cq, c^, . . . ,cr_^ 
such  that 
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, ->  r-1  fx 

f m (y)K(x,y)dy  = f(x)  - £ c.k.(x)  + I J(x,y)f(v)dy 

JO  j=0  1 J Jo 

where 

r Sr 

J(x,y)  = (-1)  — -K(x,y),  y < x. 

3y 

Applying  this  identity  to  G^tx)  if  follows  that  there  exists 

in  the  null  space  of  L such  that 

1 - i fx 

G«(x)  = ~T  Er(Nx)  “ VN(X)  + J(x,y)E  (Ny)dy. 

N N JO  r 

We  define  = G + v . From  (10)  we  note  that  (8)  and  (9) 

will  follow  provided  that 

(ID  l|ErIL-er 

* 

and 


(12)  lim  max  | f*  J(x,y)E  (Ny)dy|  = 0. 

N-**>  0<x<l  Jo  r 

00 

The  expression  (11)  for  the  L -norm  of  is  well-known. 

It  is  easily  deduced  from  the  Fourier  series  expansion  of 

1 TT  X_ 

e K (x)  and  the  fact  that 


r 1 1 oo 


Er<T> 


r even 


|Er(0) | , r odd 
Thus  it  remains  to  verify  (12) . 


Let  M be  an  integer.  Divide  [ 0 , 1 J into  M equal  pieces, 
Ii  = 1 = Let  be  the  charac- 

teristic function  of  the  interval  1^  and  define 
M-l 

SM(x,y)  = l J(x,-)gi(y).  Then  lim  ||J~SMI  L = °.  where 
i=0  M-*» 

|*||  denotes  the  L -norm  on  [0,1]  * [0,1].  Now,  for  any 
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X £ [0,1] 


l£  J(*.y)Er 


^ J-S 


(Ny)dy | 


.(Ny)|dy+M| |j 


However 

Thus 


f |E  ( 

Jo  r 1 

’ I Jl1Er^Ny)dyl"  I Er  ^y)  I dy 

' f. 


max 

0<i<Mfl 


I Ji  Er(Ny)dy | 


for  all  i=0 ,1 , . . . ,M-1 


lim  max 
N-**>  O^x^l 


J(x,v) E (Ny) dy | < | | J-S 


^ ' I oo 


j: 


Er(y) |dy. 


Letting  Lb00  we  obtain  (12)  and  thus  the  proof  is  complete. 

REMARK.  We  conjecture  that  the  above  theorem  remains  valid 

for  any  r-th  order  differential  operator 
r-1 

L = Dr  + y a.D11  . 
j=0  d 

Our  proof,  however,  requires  the  upper  and  lower  bounds  given 
by  (5)  and  (6)  which  were  proven  in  [3]  only  for  differential 
operators  L allowing  a global  factorization  into  real  linear 
factors . 
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Let  BV[0,1]  denote  the  set  of  all  functions  of  bounded 
variation  on  [0,1].  For  any  X e BV[0,1],  we  set  ||x||  = 
total  variation  of  X,  and  define 


Mr  = Mr [0, 1]  = { f : f (x) 

X e BV[0,1]  . 


% i 

£ ax 

i=0 


+ 


1 

(r-1) ! 


t)r+-1dX(t\ 


T.n  this  paper,  the  n-width  both  in  the  sense  of  Kolmogorov  and 
Gel'fand,  for 


8r  = { f : f e Mr , ||x||  <.  1} 

is  found.  In  addition,  we  solve  the  related  problem  of  optimal 
recovery  of  a function  f e 8^_. 

1 n-Widths 

Let  d (8  iL^fO,!])  = d (8  ) denote  the  n-width  in  the 
n r n r^ 

sense  of  Kolmogorov  of  8 in  L [0,1],  given  explicitly  by 

d (8  ) = inf  sup  inf  ||f  - g||  , 

n r X feB  geX  1 

n r 6 n 

where  theinfimum  is  taken  over  all  n-dimensional  linear  sub- 
spaces X^  of  L^[0,1]  and  ||f|[^  = / |f(t)|  dt.  The  n-width 
in  the  sense  of  Gel'fand  for  8^  in  C[0,1]  is  given  as 

dn(B  ) = dn(B  ; C [0,1])  = inf  sup  ||f|L, 

r r L feBAL  1 

n r n 

where  the  infimum  is  taken  over  all  subspaces  L^  of  C[0,1] 
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of  codimension  n.  An  extremal  subspace  for  either  d (8  ) 

nr 

or  d (8^_)  is  any  subspace  for  which  the  respective  infimum 
is  attained. 

A perfect  spline  P of  degree  r with  s knots  at 


x1,  . . . , xg,  0 = xQ  < < 

expressible  as 


< x < x ,,=  1 is  any  function 
s s+1  J 


r-1 


P(x)  = X a.x1  + c X (-1)^  J ^+1(x  - t) 


.r-1 


dt . 


i=0  j=0 

The  following  result  is  essentially  contained  in  Karlin  [1], 


x . 
J 


THEOREM  1 . For  n >_  r , there  exists  a perfect  spline  r = Q > 


unique  up  to  multiplication  by  -1,  of  degree  r with  n 

( r ) 1 1 

> ||  =1  which  satisfies  the  boundary 

J i) 


knots  such  that 
,( i) 


conditions  Q (0)  = Q (1)  =0,  i = 0,  1,  r-1,  and 

equiosclllates  at  n - r + 1 points  in  (0,1),  i.e., 

QCr^)  = (-l)1a||Q||oo,  i = 1,  2,  ...,  n - r + 1,  0 < < ...  < 

n < 1,  a = +1  or  -1,  fixed. 

n-r+1  — 

Utilizing  Theorem  1,  we  obtain  the  following  result 
THEOREM  2.  For  r ^ 2, 


(1)  dn(Br)  = d 


"(Br>  - / 


n < r 


Ik 


r-1 


. , n >.  r 

n,r"°° 

(2)  Q has  exactly  n - r simple  zeros  {c.}.  . in  (0,1), 

n,r  — *- l x=l  — 

and  the  linear  space  spanned  by  the  functions  {1,  x,  ...,  x 
1T"”1  1T““1 

(x  - Cj^)+  » •••»  (x  - ^n_r)+  ) is  an  extremal  subspace  for 

w- 

(3)  L*  = { g : g e C[0,1],  g(5j  =0,  i = 1,  2,  ...,  n},  where 

n I 


Qn  r is  an  extremal  subspace 


are  the  n knots  of 

for  dn(8  ) . 

r 

The  proof  of  Theorem  2 is  based  upon  duality  and  analysis 
similar  to  that  found  in  [2], 
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2 Optimal  Recovery 

Let  x = (x, , . . . , x),  x.  = 0 < x1  < ...  <x  < 1 = x , , 
— 1 n 0 1 n n+1 

be  given  and  denote  the  vector  (f(x^),  ...,  f(x^))  by  f^. 

The  problem  of  optimal  recovery  is  one  of  determining  a rule 

for  best  recovering  f e 8 , n ^ r 2,  based  on  the  informa- 

00  ^ 

tion  _f  (for  the  L analogue  of  this  problem  see  [3]). 

Any  transformation  R from  {jf : f e 8^}  into  L^[0,1] 

determines  a recovery  scheme  Sf  = Rf  for  f e B . The  error 

r 

for  recovery  based  on  S is  defined  as 


and 


II I “ slli  = SUP  llf  “ sflli> 

1 feB  1 

r 


E(x)  = inf  (| | X - S||^:S  a recovery  scheme} 


is  the  minimum  error  in  recovering  f e 8^  from  the  informa- 
tion _f.  S*  is  called  an  optimal  recovery  scheme  provided 
that  ||  I - S*||  = E (x)  . 

THEOREM  3.  For  every  n >_  r >_  2,  and  each  x as  above,  there 
exists  a function 


n , n]  x 
P (x)  = Z — f 3 ^ 

- j=0  k ’ x. 

J 


1 — 1 
(x  - t)^  dt  + Z c.(x  - x.)^ 

j=l  J J 
(i) 


which  satisfies  the  boundary  conditions  P (1)  =0,  i = 0, 


r - 1,  and  equioscillates  n - r + 1 times  on  (0,1). 


n - 


r simple  zeros  {^^(x)}^^,  0 < £^(x)  < 


n - r . 


1, 

P has 
x 

Cn_r(x)  < 1,  and  xi  < ^(x)  < x^,  i = 1, 

Based  on  Theorem  3,  we  define  a recovery  scheme  S*  by 

interpolating  the  function  f at  the  values  {x^}^_^  by 

IT”*  1 %»  _ 1 

linear  combinations  of  1,  x,  ...,  x , (x  - S^(x.))+  » • ••» 
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(x  - Cn_r(x))^  1»  i.e.,  (S*f)(x1>  = fCx^,  i = 1,  n. 

Since  < £^(x)  < x^+j_,  i = 1,  n - r,  such  a recovery 

scheme  is  well-defined. 

We  prove  the  following  theorem  by  using  Theorems  1-3, 
and  analysis  paralleling  that  used  in  the  proof  of  Theorem  2. 

THEOREM  4.  S*  is  an  optimal  recovery  scheme  for  B , and 

E(x)  = HpJIoo-  Furthermore,  if  |_  = (£  , ...,  £ ) are  the 

knots  of  Q , then  min  E(x)  = E(£)  = ||Q  ||  = min||P  ||  . 

n,  r x — — llxn,rn“  x 11  x1'® 

Full  details  and  extensions  of  all  the  above  results  will 
appear  elsewhere. 
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ON  NIKOLSKII-TYPE  INEQUALITIES  FOR 
ORTHOGONAL  EXPANSIONS 
R.  J.  Nessel  and  G.  Wilmes 

Nikolskii-type  inequalities,  i.e.  inequalities  between 
different  norms  of  a function,  are  established  for  polynomials 

c <t>  where  { } is  a regular  orthonormal  system  in 

K.  U K.  K K 

weighted  Lebesgue  spaces  LP,  1 < p < As  applications, 

expansions  into  Legendre  polynomials  and  Hermite  functions 
are  considered. 


Nikolskii-type  inequalities,  following  the  usual  nomen- 
clature, are  inequalities  between  different  norms  of  the  same 
function.  Thus  let  us  consider  trigonometric  polynomials 


t (x) : = £.  c e 
n k=-n  k 


ikx 


in  L 


2tt 


with 


> , 2tt  ' ^ 


TT 


[ (1/2tt) / | f ( u ) |Pdu] 
-TT 

ess.  sup  I f (u) | 


1/p 


1 p < 

P = 00 . 


respectively  (we  only  consider  these  values  of  p though 
there  are  of  course  counterparts  for  any  p > 0) . Then  S.  M. 
Nikolskii  (1951)  proved  that 


(1) 


q,  2tt 


< C 

- p.q 


nl/p  - l/qj 


p,2?r 


(1  < p < q < °°) 


with  G independent  of  n.  Let  us  mention  that  inequali- 

p,q 

ties  of  this  type  were  already  considered  by  D.  Jackson  (1933) 
in  the  particular  case  q = the  result  (1)  itself  was  ob- 
tained independently  also  by  G.  SzegO-A.  Zygmund  (1953). 

The  example  of  the  Fej£r  kernel  shows  (cf.  [7,  p.230]) 
that  the  order  in  (1)  is  the  correct  one.  The  best  result 
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known  so  far  with  respect  to  the  constant  reads 


i'JI,.*,  - <2pon+1)1/P  ' 1/qH':Jlp,2,  a i p 5 <1  i -). 


Pq  denoting  the  smallest  integer  not  less  than  p/2.  For 
this  one  may  consult  papers  of  I.  I.  Ibragimov  (1959)  as  well 
as  A.  F.  Timan  [7,  p.  2 2 7 f f ] . The  exact  value  of  the  constant 
in  (1),  however,  seems  to  be  an  open  problem.  Corresponding 
results  are  available  for  entire  functions  of  exponential 
type  as  well  as  for  functions  of  several  variables.  But  for 
detailed  bibliographical  comments  as  well  as  for  certain 
extensions  we  refer  to  [2; 4]. 

Nikolskii  inequalities  have  received  considerable  atten- 
tion because  of  their  wide  applicability  in  various  problems 
of  analysis.  Thus,  for  example,  Jackson  was  interested  in 

o°  # 2 

deriving  L -estimates  from  known  L -results  in  connection 
with  certain  orthogonal  systems;  Nikolskii  used  (1)  to  obtain 
embedding  theorems  for  his  spaces.  So  it  seems  to  be  of  some 
interest  to  treat  the  matter  in  the  frame  of  arbitrary  orthog- 
onal expansions.  To  this  end,  let  us  consider  the  following 
concrete  situation. 

Let  LP(a,b),  1 P °°,  <_  a < b be  the  space  of 

functions,  pth  power  integrable  with  respect  to  the  weight 

w ( x)  ^ 0: 

(J  |f(u)  |Pw(u)du)^P,  1 £ p < °° 
a 

ess.  sup  |f(u)  | , p = a*  . 

00  n 

Let  {$,  }.  n be  an  orthonormal  system  such  that  {4>,  } cr  Lp 
k k=0  3 k w 

for  all  1 <_  p <_  «>,  and  suppose  that  there  exist  a constant 
M > 0 and  some  real  6 e R such  that 


(2) 


< 

w — 


M, 


°°,  w 


< Mk 


k > 0. 
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Then  to  f e Lp(a,b)  one  may  associate  its  Fourier  series 
w 

b 

f ^ E~=0  f'<k>V  f'(k):  = I f(u)*k(u)w(u)du. 

a 

Let  us  consider  the  Ceshro  means  of  order  j e (P  (the  set  of 
nonnegative  integers) 

(C,j)  f:  = z"  (Aj  ,/A^ ) f " (k)  <p  , Aj:  = (n+i). 
n k=0  n-k  n k n n 


Then,  following  Stein  [6],  the  system  is  said  to  be 

regular  if  for  some  j e IP  there  exists  a constant  A > 0 

such  that  for  all  1 < p < <*>,  f e Lp(a,b),  n e P 

w 


(3) 


(C’J)nfl 


Al|f| 


,w 


Denoting  by  II  the  set  of  all  polynomials  r :=  E,n  _ c,  4>, 
n n k=0  k k 

of  degree  n,  we  have 

THEOREM.  Let  the  system  {$^}  be  regular  (for  some  j c P)  , 
and  suppose  that  (2)  holds  true  for  some  6 e IR.  Then 


(4)  || r ||  < 1 CP 

11  n11  q , w — 


/n(26+1),  6 > -1/2^  1/p‘1/q 
log  n , 6 = -1/2 
1 , 6 < -1/2 Jj 


n"p,w 


for  any  1 < p < q < eo  with  C independent  of  n,  p , q . 

Proof . Let  us  consider,  for  example,  the  case  6 > -1/2. 

We  essentially  follow  the  device  of  Stein  [6]  in  how  to  use 
the  Riesz-Thorin  theorem  for  operators  given  on  certain  linear 
subspaces  such  as  the  identity  operator  on  the  set  of  polyno- 
mials. Indeed,  it  is  an  immediate  consequence  of  (2)  that 
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(5) 


< 

'oo  w — 


"nnl,w  k=0M  k 


< CM2„24+1l|r  ||, 

, W — 11  n"  l,w 


for  some  absolute  constant  C' . Let  X be  a smooth  function 
on  such  that  A(t)  = 1 for  0 <_  t <_  1 and  =0  for 

t > 2.  Then  the  means 


CO 

Lnf  :=  Zk=0  *(k/n)f~(k)*k 


possess  the  following  properties: 


(6)  (i) 

(ii) 

(iii) 


(iv) 


L f e JI_  for  all  f 
n 2n 


L r = r for  all  r 
n n n n 


uniformly  for  all  1 _< 


L f||  < C"M2n2i+1 

n , w — 


c LP(a,b) , 
w 

e n , 

n 

<J+1)(o|dol|f||piU 

p < °°,  f e Lp(a,b),  and  neiP, 
w 

Ilf  II,  for  all  f e L^(a,b). 

11  "l,w  w 


Indeed,  (i),  (ii)  are  obvious;  (iii)  follows  by  a multiplier 
criterion  as  given  in  [1],  so  that  (iv)  is  an  immediate  con- 
sequence of  (i),  (5),  and  (iii).  Thus,  the  bounded  linear 
operator  L^,  well-defined  on  all  LP(a,b)-spaces,  1 < p <_  », 
is  of  type  (1,1),  (1,«>),  (»,«),  so  that  the  classical  Riesz- 
Thorin  convexity  theorem  implies  that  is  of  type  (p»q) 

for  all  1 1L  P 1 £ 00 • In  view  of  the  projection  property 
(6,ii)  this  completes  the  proof  of  (4). 

Obviously  (4)  reproduces  the  original  result  (1)  of 
Nikolskii,  apart  from  the  constant.  Concerning  arbitrary 
orthonormal  expansions,  if  (2)  is  satisfied  for  some  6 > 0, 
then  the  order  in  (4)  is  increased  in  comparison  to  the 
trigonometric  case.  But  this  cannot  be  removed  in  general. 
Indeed,  let  a = -1,  b = +1,  w(x)  = 1,  and  consider  the 
(normalized)  Legendre  polynomials 
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$*(x):=  (-l)k[2kk!]  Vk  + l/2(d/dx)k[(l  - x2)k]. 

Then  !|'1>^||cjo  = $£(1)  = /k  + 1/2  so  that  (2)  holds  true  for 
6 = 1/2.  Since  (3)  is  satisfied  if  e.g.  j = 1,  inequality 

(A)  implies  for  any  polynomial  r*:  = ^=o  ck^k  t^iat 

|M|  < c(1-p/q)t,2<1/p-1/q)||r*||  (1  < p < q < ®) . 

u ni i q — ii  ni i p — — — 

Here  the  order  cannot  be  improved.  Indeed,  if  P (x)  is  an 

n 

arbitrary  algebraic  polynomial  of  degree  n on  the  interval 
[-1,1],  then  it  is  a well-known  consequence  of  Markoff's 
inequality  that 

llpnllq  1 ((P  + l)n2)1/P"1/q|lpnllp  (1  < p < q < •), 

and  this  is  best  possible  with  respect  to  the  order  (cf.  [7, 

p. 236]) . 

On  the  other  hand,  M.  F.  Timan  [8]  announced  examples  of 
unbounded  expansions  (e.g.  the  Haar  system)  which  nevertheless 
satisfy  (4)  with  order  n^P 

But  there  are  also  examples  of  expansions  satisfying  (2) 
for  some  negative  6.  In  fact,  let  a = -«,  b = +«>,  w(x)  = 1, 
and  consider  the  Hermite  functions 

$**(x)  :=  (-l)k(2kk!/^r1/2eX  /2(d/dx)ke"X  . 

Then  (2)  is  satisfied  for  6 = -1/12  (cf.  [3,  p.  571]).  Since 
(3)  holds  true  if  e.g.  j = 1,  the  Theorem  yields  for  any 
polynomial  r** 
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Let  us  finally  note  that  one  may  formulate  analogs  of 
the  Theorem  in  the  setting  of  abstract  Banach  spaces.  In 
fact,  the  multiplier  criterion  used  in  the  course  of  the 
proof  works  in  connection  with  arbitrary  spectral  measures  in 
Banach  spaces  satisfying  suitable  summability  hypotheses.  For 
this  as  well  as  for  further  extensions  and  details,  however, 
see  [5]. 
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HYBRID  FINITE  ELEMENT  METHODS 


J.  T.  Oden 

Some  new  techniques  for  determining  error  estimates  for 
so-called  hybrid  finite-element  methods  for  the  approximation 
of  solutions  of  linear  elliptic  boundary  value  problems  of 
second-order  are  described. 

1.  Introduction 

The  hybrid  finite  element  methods,  developed  largely  by 
Pian  and  Tong  and  their  associates  (e.g.  [1])  have  been  used 
successfully  in  the  numerical  solution  of  a variety  of  prac- 
tical problems,  and  there  appear  to  be  certain  classes  of  pro- 
blems in  which  these  methods  have  advantages  over  conventional 
finite  element  techniques  (e.g.  shell  problems,  elasto-plas- 
ticity  problems,  problems  of  cracks  and  stress  singularities). 
However,  the  intrinsic  mathematical  properties  of  these  me- 
thods have  only  recently  begun  to  be  investigated  [2-4].  We 
describe  here  a fairly  general  theory  of  mixed-hybrid  methods 
developed  in  [4]. 

2.  A Mixed-Hybrid  Variational  Principle 

Consider  as  a model  problem, 

(2.1)  - Au  + u = f in  ft 

u = 0 on  3ft 

2 

where  A is  the  Laplacian  operator  in  ]R  , f £ L (ft),  and  ft  is 

2 1 
a convex  polygonal  domain  in  IR  . 

Let  P denote  a triangulation  of  ft  into  E triangles  ftg, 

i.e.  , 
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(2.2) 


ft  = U ^ » ft  n ftf  = 0 if  e i f . 

e=l 


We  also  define  a collection  of  boundary  pieces  of  P 


(2.3)  r = \J  r 


e=l 


r=9fi-s,  r = r n rc 

e e e ef  e f 


where  S is  the  set  of  vertices  of  ft  . 
e e 

We  now  introduce  the  following  definitions  and  conven- 
tions : 

I.  Spaces  Defined  on  the  Partition  P 


(2.4) 

Hm(P)  = {u:  ug  £ Hm(fte)  , 

1 < e <_  E} 

E 

■MM| 

(2.5) 

1 M 1 m =0  1 Uel  1 

H (P)  - e 

e=l 

2 V 

h (ft  ) 

e 

(2.6) 

W(T)  = completion  of  L9(r) 

in  11*11 

w(r) 

where 

(2.7) 

E 

luii  = i i u 

W(D  L-  e 

e=l 

ll2^  i"2 

h 2(9ft  ) 

e 

in  which 

||*||  j,  is  a dual  norm 

H“'2(9ft  ) 
e 

°f  11*11  j, 

h 2( an  ) 

e 

II. 

Product  Space 

(2.8) 


X = H (P)  x L2(P)  x ft/(r) 


(2.9)  INI  - U |u|  \ \ 

' X H (P) 


+ o 


l2(p) 


+ 11*11 

W(r) 


where  L2(P)  = L2(P)  x L2(P)  and  a = (o1,o2). 
III.  Special  Bilinear  and  Linear  Forms 
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(2.10)  BrXxX^m,  B(A, A)  ^ be(~e>~e) 

e=l 


A, A = arbitrary  triples  in  X;  A = (u,a,ip) 


b (A  ,A  ) = / [a  • Vu  + u u + (Vu  - a ) • a ]dx, 
e ~el  e I ~e  e e e e ~e  ~e  1 


dx„ 


(2.11) 


f 

*3  ft 


+ d)  (<1*  u + ^ u )ds 
e e e e 


£ 

e=l 


(2.12)  F(A)  = ) f (A  ) ; f (A  ) 
~ 7 e ~e  e ~e 


L 


f u dx,dx„ 
e e 12 


IV.  Properties  of  the  Forms  B(*,*)  and  F(») 

In  [7],  the  forms  B(A,A)  and  F(A)  are  shown  to  have  cer- 
tain properties  which  guarantee  that  the  variational  problem 
of  finding  A £.  X such  that 
(2.13)  B(A,I)  = F(I)  V I £ X 

has  a unique  solution  in  X.  Moreover , it  can  be  shown  that 
the  mixed-hybrid  variational  problem  (2.15)  is  equivalent  to 
the  weak  form  of  (2.1). 

3.  Finite  Element  Approximations 

The  variational  principle  described  in  the  previous  sec- 
tion has  been  designed  so  as  to  be  naturally  adaptable  to 
mixed-hybrid  finite-element  approximations.  As  expected,  the 
triangles  ft  are  now  viewed  as  finite  elements. 

We  introduce  the  spaces  of  polynomials:  Q^(p)  = (U£ll  (P) , 

ue  £ W*  1 - e ± E}’  9°(P)  " {?  ^ h(P)’  ?e  ^ Pr(V’ 

1 < e < E},  and  Q'V)  * {Y  € W(r),  * f - *|f  £ pt(ref)* 

1 ef 
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1 £ e,  f £ E,  e>f}  where  P (n  ) is  the  space  of  polynomials 

of  degree  < k on  !J  , etc.  These  spaces  have  the  following  in- 

® 2 

terpolation  properties:  given  u £ H (fi  ) , a £ (Hq(ft  ))  , 
m_  3 1 2 e e 

ip  £ H (fi  , 3ft  ) (=  completion  of  the  space  of  normal  deri- 

e e 

vatives  3v/3n  £ L„(3ft  ) in  the  norm  I U|  I = 

2 6 Hm-3/2on  ) 

9 v ^ 

inf{|  | v | | ; m > 1,  — =<!'})  there  exists  constants  C , 

Hm(fi  ) 9n  3ft  1 

e e 

C 2 , C3  > 0,  independent  of  the  partition  and  elements  U,  £, 
and  ip  in  the  respective  spaces  in  (3.1),  such  that 

I I u ~ U | | £ C h"|  | u | | n 

H (ft  ) 6 II  (0  ) 

e e 


I?  - =11  < C h^l |o| I 

L,(n  ) ' Hq(n  ) 

~ e - e 


I*  “ *11  i c3heI^ILm_3/2 

H 2(3ft  ) J e H J/^(3 ft  ) 
e e 


where  h^  = dia(ft  ) and  (with  0 £ s <_  1,  m >_  1), 

p = min(k+l-s,  t-s) ; v = min(r+l,q);  0 = min(t + -j  , m-1). 

(3.2) 

Let  A = (U.E.T)  £ Q*(P)  x Q°(P)  x Q^2(r)  = xhc  X. 

Then  the  mixed-hybrid  finite  element  method  consists  of  seek- 
ing the  A £ X^  such  that 

(3.3)  B(A,A)  = F(A)  V I (X  , 

~ ~ - h 


where  B(*,*)  and  F(*)  are  the  forms  defined  previously. 

Let  n1  and  JI^  denote  orthogonal  projections  of  H*(ft  ) and 

L (ft  ) onto  Q.  (ft  ) and  Q (ft  ) , respectively.  We  introduce  the 
e e k e re  1 

special  stability  parameters 
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u-  inf  ||n^J|2/||»||2 


e e 


KQt2 


— 2>  > 


/I  Ivv 


(3.4) 


v = inf  I | n VV  | | 

V h(ae>  h(V 

G K. 

v - sup  ||7V  -n°7v  II  / 1 1 w II 

v yv  yv 

e k 

y y 

B(P)  = min  {p  " • ve  “ }’  °lPp»vP’Yel1 

1 < e < E eee 


where  z £ H (ft  ) is  the  solution  of  an  auxiliary  problem, 

2 6 6 3z 

-Vz  + z = 0 in  !1  and  = y on  3Q  , y ^ Q 2(3ft  ) , and 
e e 2 e 3n  e ^t  e 

llzelli,n  = 1 1 '•'I  1-15,30  • We  then  have 
e e 

Theorem  3.1.  If  3(P)  > 0,  there  exists  a unique  finite 
element  solution  A°  £ of_  (3.4)  and  the  following  estimate 
holds : 

£ > 2 


(3.5) 

where 


< Ch“| |u°| 


1 £ 

H (P) 


. o ,.o  o o 3u  wo . 
e=(u  - U , Vu  - E , ~ y ) 


c = (1  + 2/l5/B(P))  max{C^ ,C2*  } , a = min{k,r+l , t + j , -1 } . 


We  remark  here  that  one  easy  way  to  guarantee  that  6(P)  > 0 is 

to  choose  the  polynomial  spaces  such  that  k-1  r and  k t+1 

(t  even),  k >_  t+2  (t  odd)  over  a triangular  element 
A necessary  condition  can  also  be  given. 

Theorem  3.2.  In  order  that  (3.3)  has  a unique  solution, 
it  is  necessary  that  p > 0,  1 < e < E.fl 

G < 

In  the  special  case  in  which  Q° (P)C  V(Q^(P) ) , then  y^  = 0, 

v * 1,  and  B(P)  * min  p = p.  Then  p > 0 is  a necessary  and 

e e 

sufficient  condition  for  the  existence  of  a unique  solution. 
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A necessary  and  sufficient  condition  that  the  parameters 
be  positive  is  furnished  by  the  rank  condition. 

THEOREM  3.3.  The  pa rameter  u_  jji  (3. A)  is  > 0 if  and 

G 4 

only  if  the  following  condition  holds:  for  any  y £ Q~^(r), 


(3.6) 


yeue  ds  = o vueC  Q^(fig)  =*  y£  = o 


Additional  properties  of  such  approximations  are  given  in 


[A]  • 
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ON  SIMPLE  INTERGRABILITY  AND  BOUNDED  COARSE  VARIATION 

Charles  F.  Osgood  and  Oved  Shlsha 

1.  Let  f be  a complex  function  on  Then  [1,2] 

f is  simply  integrable  iff  there  exists  a number  I with  the 

following  property:  For  every  e > 0 there  are  positive 

numbers  B and  A such  that  if  0 = x_  < x,  . . . < x , 

0 1 n 

\ > Bi  and  Vi  i 5k  - V *k  - Vi  < 4 £or 

k = 1,  2,  . . . , n,  then 
n 

1 1 - £ f (5k)  (xk  - Xj^)  I < e. 

k=l 

If  f is  simply  integrable,  then  I is  unique  and 

oo 

equals  / f(x)dx,  the  improper  Riemann  integral  of  f on 
0 

[0,°°).  (By  which  we  mean 
R 

lim  J f(x)dx  (finite) 

R-x»  0 

R 

where,  for  each  R > 0,  ^ f(x)dx  is  the  proper  Riemann  in- 
tegral of  f on  [0,R].)  If,  on  the  other  hand,  f is  im- 
properly Riemann  integrable  on  [O,00),  then  it  is  simply  in- 
tegrable iff  it  is  of  bounded  coarse  variation.  This  means 
that  for  every  e > 0, 

n 

(1)  sup  E IfCx^)  ” < 00 

k=l 

n 

The  sup  is  taken  over  all  sequences  (xv)  , n = 1,  2, 

* k=0 

with  0 - Xg  < Xj^  < . . . < x^;  l > e,  k = 1,  2, . . .,n. 
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The  purpose  of  the  present  note  is  to  further  study  the 
concepts  of  simple  integrability  and  bounded  coarse  variation. 

2.  THEOREM  1.  Let  f be  a complex  function  on  [0,°°)  . The 
following  statements  are  equivalent: 

(i)  For  k = 1,2,...,  0 = sup  | f (y)  - f (x) | < and 

k-l<x<y<k 

^k~l  ^k  < °°"  ^ere»  "0"  stands  for  oscillation.) 

(ii)  f is  of  bounded  coarse  variation. 

(iii)  For  some  e > 0,  (1)  holds . 

Proof.  Assume  (i).  Let  e > 0,  0 < x„  < x,  < ...  < x , 

—01  n 

n ^ 1;  min  (x,  - x ) e.  Consider  the  union  of 

l<k<n  k k 1 

{xn,x,,...,x  } and  the  set  of  integers  N with  x„<  N < x , 
U 1 n U — — n 

and  arrange  it  as  a strictly  increasing  sequence  ^^^=0' 

Then 


< (2  + c-1)  r".,  ok. 

Assume  now  (iii).  Then  f is  clearly  bounded  in  [0,<=°). 
For  n,  r = 1,  2,  ...  set 


0 = sup  | f (y)  - f (x)  | . 

’ (n-l)r<x<y<nr 

If  r > 1,  then  0 ..  < 0 for  n = 0,1,2,...,  and 

— nr+j  — n+l,r 

j = l,2,...r.  Hence  if  for  some  r 1 we  show 

l , 0 < °°t  then  it  will  clearly  follow  that  £.  ,0.  < “. 

n=l  n, r J k=l  k 

Let  r be  an  integer  2e.  We  shall  prove 

00  00 

Z ,0  < °°.  Suppose  Z ,0  =».  For  n = 1,2,...  choose 

n=l  n,r  n=l  n,r 

x ,y  in  [(n-l)r,  nr]  such  that  |f(x  ) - f(y  ) I > 0 /2, 

n 7n  J 1 n wn  1 — n,r 

so  that 
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|f(xn)  - f ( (n  + i )r)  I + I f (yn)  - f((n  + ~ )r)  | >_0^  Jl. 

Then 

Zn=ilf  (xn)  “ f + 7 >r)  1 + lf  (yn)  _ f((n  + 7 )r)  | = ®. 

Therefore  at  least  one  of  the  following  diverges: 

£n-ll£(x2n)  ' t<<2n  + 

■ £<(2n  + i'r)|> 

Cilf<W  - £(<2n+  t >r>!- 

■ £<(2n  + \ )r,l> 

and  hence  we  have  at  least  one  of  the  following: 

Zn=llf(x2n)  ' f ((2n  " 7 >r) I + lf((2n  + 7 )r)  - f(x2n> | = », 
Zn=1lf(y2n)  ' f((2n_  7)r)l+  I f ( (2n  + -j  ) r)  - f (y2n>  |=  «, 

Zn=llf(x2n+l)_f((2n  ' 7 )r)l 

+ | f ( ( 2n  + | )r)  - f(x2n+1)|  = », 

Cilf(y2n+i) " f((2n " 7 )r)| 

+ | f ( (2n  + j )r)  - f(y2n+1) | = »• 

However,  each  of  these  relations  contradicts  (iii). 
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3.  For  our  second  theorem  we  need  the  following 

DEFINITION.  A sequence  ^xn^n=o  is  called  an  allowable 
partition  iff 

(2a)  0 = Xq  < < x2  °°, 

(2b)  0 < inf  (x,  - x,  ) <_  sup(x^  - x,  . ) < “. 
k>  1 k>l  k 

NOTATION.  Given  a complex  function  f on  [0,”),  and  given 

u,v,  0 <_  u < v,  we  set  0(f,u,v)  = sup  | f (y)  - f(x)  |; 

u<x<y<v 

observe  that  if  f is  real  on  [u,v],  then 

0(f,u,v)  = sup  f (x)  - inf  f (x) . 
u<x<v  u<x<v 

THEOREM  2.  Let  (xn)n_Q  be  an  allowable  partition,  and 

let  f be  a complex  function,  Riemann  integrable  on  each 

[ 0 , R ] , 0 < R < «.  A necessary  and  sufficient  condition  for 

f to  be  simply  integrable  is  that  ^f  (5n)  (x^  ~ Xn-1^ 

converges  whenever  x ,<E  < x . n = 1.2 

n-1  — n — n 

To  prove  Theorem  2,  we  shall  need  the  following 

LEMMA . Let  f be  a complex  function  on  [ 0 , <=°) , bounded  on 

each  [0,R],  0 < R < °°.  Suppose  both  (x  ) „ and  (y  )°°  „ 

— n n=0  wn  n=0 

are  allowable  partitions,  and  one  of  the  series 

Ek=l^(f ,xk-l*xk) ’ ^k=l^^f ’ xk-l* Xk^  ^xk  " xk-l^  converges.  Then 

?-°  jo-lo.tj!  £k-l<,<f'yk-l>yk)(>'k  - yk-l>- 

4.  Proof  of  the  Lemma.  Clearly  l”_^0(f ,x^_^,x^) 

converges,  and  it  suffices  to  prove  the  convergence  of 

00 

Zk=l^f  ,yk-l’yk^  ' Arran8e  the  set  {xQ,  yQ,  x.^  y.^  . . . } as  a 
strictly  increasing  sequence  (z^)”^*  Let  x^  = z , 

y.  = z » k = 0,1,2,...,  and  let  M be  a positive  integer 
k.  n. 
k 
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such  that  each  [x.  ,,x,  ] contains  at  most  M z 's.  Then 
k-1  k n 

for  r = 1,2, , 

= ‘"h=lZ'j=mh_1+l^(f,Zj-l,Zj) 

1 (M  - l)Z^=10(f  ,xh_1,xh)  <_  (M  - 1)  z“_jO(f  »xh_1»xh) . 

oo 

Hence  Z^_^0(f,Zj  ^,z^)  converges. 

Let  k ^ 1,  y x < y <_  y . Then,  as  is  easily  seen, 


|f(y)  - f(x)  | 1 ^^^(f^j-rZj). 

nk 

and  hence,  0(f,y  ,y,)  < Z 0(f,z  ,z.).  Therefore 

k-l  k.  — n,  .+1  j-i  J 

for  N = 1,2,..., 

Zk=l0(f,yk-l’yk)  - Zk=l  Ej=nk_1+l0(f,Zj-l,Zj) 

— Zj=l^^f’zj-l’zj^  * 


and  hence,  Zk=l^^f ,yk-l,yk^  < “* 

5.  Proof  of  Theorem  2.  Necessity.  Let  x , < E < x , 
*-  n-1  — n — n 

n = 1,2,...,  and  set 


x 

M = f n f(t)dt,  n = 1,2, 

n 

x i 

n-1 

As  f is  of  bounded  coarse  variation,  therefore,  by 

OO 

Theorem  1,  I,  ^k  < 00 . Hence,  by  the  Lemma, 

oo  . 

Z ..0(f,x  ,,x  )(x  - x ,)  converges. 

n=i  n-i  n n n-1 

Let  e > 0.  Let  N be  an  integer  0 such  that  if 
N <_  < N£,  then 
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r*N 


2 f(t)dt|  < e/2 


and 


N2 

■ vi>  * e/2- 


Let  N <_  Nx  < N2.  Then 

No  N 

|z  K,  ,if(C  )(x  - x .)  < |z  ,,  M I 
' n=N  +1  n n n-1  1 — 1 n=N^+l  n1 

N2 

+ Z N , . |f(C  )(x  - x ) - M | 

n=N^+l  1 n n n-1  n1 


Nr 


Nr 


< | / 2 f(tMt|  + 2„,N  +1<Kf.Vrxn)(x„  - Vi>  < 

XN1  1 

00 

Hence  Z . f(£  )(x  - x ,)  converges. 

n=l  ^n  n n-1  6 

Sufficiency . We  may  clearly  assume  that  f Is  real  on 
[0,°°).  We  first  prove  that  f is  improperly  Riemann  in- 
tegrable  on  [0,“>)  . 

Let  n be  a positive  integer.  If,  for  every  x in 

[x  , , x ] , we  had 
n-1  n 

, x 

f(x)  > (x  - X ) J n f (t)dt, 
n n-i  J 

x i 
n-1 

then  it  would  follow  that 


/ n f(x)dx 


A i 

n-1 

is  larger  than  itself  [3,  Theorem  388].  Let  y satisfy 

x , < y < x , 
n-1  — n — n 


f(y 


_1  X 

) <_  min  [ { inf  f (x)  } + 2 n,  (x  - x , ) J ° f (t)dt] 

x . <x<x  v 


V 1 ' A'A 

n-1 n 


n-1 
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Similarly,  let  x . < z < x , 
n-1  — n — n 


f (z  ) > max  [[  sup  f (x)  } - 2 n,  (x  - x . ) / nf  (t)dt  ] . 

n — ^ ^ n n-i  ' 

X <x<x  X , 

n-1 n n-1 

Then  for  n = 1,2 


f(yn)(xn  - Vl)  < / n f<t)dt  i f(zn)(xn  - Vl>- 

Vl 

and  hence,  if  0 <_  < N^,  we  have 

£n-N1+l£(yn)<xn  - xn-l>  t(t)dt 

n2  ^ 

- E„-N1+l£<zn)<xn  - xn-l>- 

Let  e > 0.  Let  N*  be  an  integer  >_  0 such  that  if 
N*  1 < N2,  then 

l£n^1+l£(yn,(x„  - Vl>l  <e/9’ 

(3)  N 

|Z  M ,,f(z  ) (x  - X .)l  < e/9. 

1 n=N^+l  n n n-1  1 

Consequently  if  N*  £ < N2,  then 


^ f (t)dt  | < e/9. 


*N, 


If  k > N*,  then,  setting  = k-1,  N2  = k,  we  have,  by 


(3), 


lf(yk)(xk  ~ xk-l}  I < e/9’  lf(zk)(xk  " ^-l^  < e/9’ 

Let  N**  N*  be  an  integer  such  that  if  k > N**, 
then  2 - x^^)  < e/9. 
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If  k > N**,  then  throughout  [x^ 

f(yk}  " 2~k  1 f(x)  1 f<zk>  + 2"k‘ 
and  hence, 

I f (x) I £ |f(yk) | + |f(zk) I + 2_k, 

so  that  if  Xj^  ± s ± xfc, 
s 

|J  f(x)dx|  < (xk  - xk_1)[|  f(yk)|  + |f  Czk)  I + 2“k] 
xk-l 

< (e/9)  + (e/9)  + (e/9)  = e/3, 


and  similarly, 
x. 


f (x)dx | < e/3. 


Let 


V*  1 s < let  xki  be  the 


smallest  x,  > s 

x k 

and  xk0  the  smallest  x,  > t,  so  that  x,  , < s < x,  , 

2.  ^ 

Xk2-i  it<xk2'  N**<kilV  Then 

t Xk  -1  xk 

|J  f (x) dx | ^ |J  f(x)dx|  + |J  2 f (x)dx | 

S S Xk  -1 

t Ki  x 

+ | J f (x)dx | < (e/3)  + (e/9)  + (e/3)  < e. 


Hence  f is  improperly  Riemann  integrable  on  [0,°°) 
For  k = 1,2,..., 


0(f,Xk-l,Xk)  = Sup  f(x)  " inf  f(x>  1 f(zk) 

{k-l-X-Xk  " 


xk_lixlxk  x-  ^ <x<x. 


- f(y.)  + 2-2  k. 
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Since  I ^(£(2^)  “ fCy^)  + 2*2  < «°,  we  have 

»Xjc_-^»Xjt)  < °°.  By  the  Lennna,  E^_^0(f ,k-l,k)  < °°,  and 
hence,  by  Theorem  1,  f is  of  bounded  coarse  variation.  There- 
fore f is  simply  integrable. 

6.  We  make  here  the  following  two  observations: 

(a)  Let  f be  a complex  function  on  [0,°°)  for  which 
{x  >_  0:  f(x)  ^ 0}  is  unbounded.  Then  there  exist  sequences 

<x„Co:  0 ■ *0  ‘ *1  * *2  •">  xn  * "■  and  «„Cr 

Vl  - 5n  L xn'  n " 1>2 such  that  Cl£<5n)(xn  ‘ xn-l> 

diverges . 

(b)  Let  f be  a complex  function,  improperly  Riemann 

00 

integrable  on  [0,°°).  Then  there  exists  a sequence  (x  ) 

n n=U 

OO 

0 = xn  < x.  < x„  . . . , x -*■<»,  such  that  Z ,f(£  )(x  - x .) 

0 1 2 n n=l  n n n-1 

converges  whenever  x , < £ < x , n = 1, 2, . . . . 

n-1  — n — n 

Let  f*  be  a simply  integrable  function  for  which 
{x  >_0:f*(x)  ^ 0}  is  unbounded,  and  let  f**  be  a complex 
function,  improperly  Riemann  integrable  on  [0,°°),  but  not 
simply  integrable.  If  we  delete  (2b)  from  the  hypotheses  of 
Theorem  2,  then  f*  provides  a counterexample  to  the  necessity 
part  of  Theorem  2,  and  f**  provides  one  to  sufficiency. 

7.  The  truth  of  (a)  is  readily  seen.  We  indicate  now  the 
proof  of  (b) . Let  f^  = Ref,  f ^ = Imf.  For  N = 0,1,2,... 
choose 


N = ygN)  < . . .<y£N)  = n + l 

N 


such  that,  for  j = 1,2, 


- O < 2 


,-N-l 


If  N is  an  integer  ^ 0,  if  1 <_  p <_  qN  <_  r , and  if 
(N)  (N)  (N),  , ..  N N N 

yk_i  l i yk  for  k = pn.  pn+1 v then 


499 


CHARLES  F.  OSGOOD  AND  OVED  SHISHA 


PN  J (N) 

- §>  f^)dx 

< y^r<  2"n'1  « - 1-2>’ 

PN  J 


and  hence, 


tr*  V'FX’S0  - ' /(«?  £i<X)dX'  < 

PN  yp  -1 


(j  = 1,2), 


which  implies 

(N) 

(4)  I, CL  *Ct»)<T®  - t£)1  - * 2^ 

PS  yp„-i 


Consider  the  sequence 


n (0) 
0 = y0 


(0)  v(1)  < <v(1)  = Vn2)  < ••• 

<•  ..<Yr  - y0  <--  yr1  Y0 


and  denote  it  xQ,  x2,...so  that  0 = xQ  < *j_  < *2"- 

xn  -f  Let  Vl<5niV  " = I'2*""  We  show  that 
ES,f(C  )(*n  " Vl}  convergeS* 

Let  c > 0.  Let  nQ  > 0 be  such  that  (i)  if  n > nQ  ' 
then  xn  > Nq,  where  Nq  is  an  integer  >_  0 with  2 e/4, 

and  (ii) 


|JXt  f ( x) dx | < e/2 


. „ „ r ipf  n < s < t;  we  claim: 

whenever  nQ  _<  s < t.  Let  nQ  _ 
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Z )(x  ~ X ,) 

1  n=s+l  n n n-1 


< e. 


Indeed,  set 


x 

S = (0(x„  - x i)]  “ / C f(x)dx. 

n=s+l  n n n-1  * 

x 

s 

Then 


|!WU  £(5„)(x„  - Vi’1  - |S|  + b ' f(x)d,<l 

X 

s 

< Is  I + (c/2). 


Denote  N,  = [x  ],  so  that  2 < e/4,  and  [x_  .] 

1 s t-1 

u > 0.  Then,  using  (4),  one  can  show: 

-N  -k  -N 

|s|  < z“  2 1 < 2-2  1 < c/2. 

k=0 


Nl  + u, 


Hence 


£ ...  f (C  ) (x  - x . ) | < c . 

1 n=s+l  n n n-1  1 
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SHAPE  PRESERVING  SPLINE  INTERPOLATION 


Eli  Passow  and  John  A.  Roulier 

For  a set  of  increasing  (and/or  convex)  data,  we  can  con- 
sider the  possibility  of  finding  a spline  interpolant,  of  pre- 
assigned smoothness,  which  is  increasing  (and/or  convex).  To 
carry  out  this  investigation  we  construct  an  auxiliary  set  of 
points  and  use  the  monotonicity  and  convexity  preserving  pro- 
perties of  Bernstein  polynomials. 


1 Introduction 

Let  A = (x„  < x.  < ...  < x },  let  j < n,  and  let 
, , u i N 

n = n^)  t*ie  set  of  splines  of  degree  n and  deficiency 

n-j , with  knots  at  x.,  i = 1,2 N-l.  That  is,  f e (A)  if 

1 1 n 
f e C [xq,x^]  and  on  each  of  the  intervals  [x^_^,x^],  1 = 1, 

2,...,N,  f e P^,  the  set  of  algebraic  polynomials  of  degree 

< n. 

Let  yQ  < y2  < ...  < y^.  For  fixed  j and  n we  ask:  Does 
there  exist  an  increasing  f e Sj^(A)  satisfying  f(x^)  = y^, 
i = 0,1,..., N?  More  precisely,  what  conditions  on  the  data 
(Xi.yp  guarantee  the  existence  of  such  an  interpolant?  Simi- 
larly, let  si  = (yi-y1_1)/(x1-xi_1) , i = 1,2,...,N,  and  sup- 
pose that  < s^,  i = 2,3,...,N.  In  this  case  where  we 

say  the  data  are  convex,  what  additional  conditions  guarantee 
the  existence  of  a convex  interpolant  in  S^(A)? 

We  can  approach  this  problem  from  a slightly  different 
point  of  view.  For  fixed  monotone  or  convex  data  and  fixed 
J we  may  ask:  How  large  must  n be  to  guarantee  the  existence 

of  a monotone  or  convex  interpolant  in  S^(A)?  (For  j ■ n; 
i.e.,  polynomials  of  degree  n,  an  answer  to  this  question  in 

Preceding  page  blank 
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the  monotone  case  was  given  in  [A, 5].)  The  advantage  of  this 
approach,  which  we  adopt,  is  that  by  minimizing  n we  economize 
on  the  number  of  parameters  needed  to  obtain  a shape  preser- 
ving interpolant  of  desired  smoothness  (class  C~^). 

2 The  main  results 

We  will  state  our  results,  with  some  indication  of  proofs. 
The  following  definition  will  be  important  in  our  investiga- 
tion. 

DEFINITION  1.  Suppose  the  data  (x^y^,  i = 0,1,..., N,  are 
non-decreasing  (and/or  non-concave).  Let  x^  = x^  + c^Ax^ 
where  0 < < 1 and  Ax_L  = x±-x  ^ i = 1,2 N.  The  num- 

bers tj,  i = 1,2,...,N,  are  said  to  be  increasing  (and/or 
convex)  (c^  }-admissible  for  the  data  (x^y^  if  the  piecewise 
linear  function  L(x)  generated  by  the  points  (xQ,y0)  , (x^.t^, 
(x2» t^) , . . . , (x^, t^) , (xN,yN),  passes  through  the  points 
^xi,yi^’  * = N-l , and  is  non-decreasing  (and/or  non- 

concave). If  = a for  all  i,  then  we  say  that  the  numbers 
t^  are  q-admissible . 

LEMMA  1.  Let  k,m,n  be  positive  integers , with  m < n.  Let 
p e PRn  and  suppose  that  p^(a)  = 0,  j = 2,3,...,mk,  and 

p^^(b)  = 0,  j = 2,3,...,  (n-m)k,  where  a < b.  (If  mk  = 1 or 
(n-m)k  = 1,  then  the  corresponding  condition  is  vacuous . ) 

Then  the  tangent  lines  to  the  graph  of  p at  x = a and  x = b 
intersect  at  x = a + m(b-a)/n.  In  particular,  if  n = 2m,  then 
x = (a  + b)/2. 

For  the  sake  of  simplicity,  we  state  our  first  theorem 
for  = a,  but  a similar  result  for  general  holds. 

THEOREM  1.  Let  m,n  be  natural  numbers , with  n > 2m,  and  let 
a ■ m/n.  Then  there  exist  increasing  (and/or  convex)  ot-admiss- 
ible  numbers  for  the  data  (x^y^  if  and  only  if  for  all  k > 1 
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there  exists  f e S^n  (A)  satisfying 
f(x±)  = y±,  1 = 0,1, . . . ,N; 

fjj)(x1)  = 0,  j = 2,3 mk;  1 = 1,2,...,N-1; 

f^^(x^)  = 0,  j = 2,3,...,  (n-m)k;  1 = 1,2,...,N-1; 

f'(x)  > 0,  x e [xq^]  (and/or  f"(x)  > 0,  x e [xq.x^). 

Thus , in  particular,  f^(x^)  = 0,  j = 2,3,...,mk;  i = 1,2,..., 
N-l . 

Proof.  Suppose  { t ^ } are  a-admissible,  and  let  L(x)  be  the 
associated  piecewise  linear  function  in  Definition  1.  Let 
q^(x)  be  the  Bernstein  polynomial  of  degree  kn  of  the  re- 
striction of  L(x)  to  the  interval  [x^_^ ,x^] , i = 1,2,...,N, 
and  let  f(x)  = q^(x),  x e [x  ^,x^],  i = 1,2, ...,N.  It  fol- 
lows from  properties  of  the  Bernstein  polynomials  [1,  pp.  13, 
23]  that  f e S^(A) , f(x^)  = y^,  i = 0,1,..., N,  and  f is  in- 
creasing (and/or  convex). 

The  converse  follows  easily  from  Lemma  1. 

The  existence  of  an  a-admissible  set  is  thus  essential  to 
the  solution  of  our  problem.  The  next  result  is  an  algorithm 
which  is  useful  in  finding  such  sets. 

a-ALGORITHM.  Let  m-^  = s^,  = min{s2 « s^/ (1-a)  } , and  for 

i = 2, 3,..., N-l  let  m^  = (s^-aM^_^) / (1-a) , = min{s^+^, 

(si-ami_1)/(l-a) }. 

THEOREM  2.  For  non-decreasing,  non-concave  data  (x^,y^) , 
i = 0,1, ...,N,  there  exist  increasing  and  convex  q-admissible 
numbers  { t } i_f  and  only  if  the  q-algorithm  can  be  carried 
out  with  m^  < si+^*  for  i = 2,3, . .. ,N-1. 

As  a corollary  to  Theorem  2 we  have  the  following,  which 
is  a generalization  of  a result  for  quadratic  splines  in  [3]. 
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THEOREM  3.  If  0 < < . . . < sN  (81+1"2s1+s1_1  > 0,  i - 2, 

3, . . . ,N-1)  , then  there  exists  a_  set  of  increasing  (convex) 
-admissible  points  for  (x^.y^).  Hence,  in  this  case,  for  all 
n,  there  exists  an  increasing  (convex)  interpolant  in  Sj^A) . 

If  we  restrict  our  attention  to  monotone  interpolation, 
it  has  been  shown  [2]  that  for  fixed  j we  may  always  choose 
n = 2j+l.  For  convex  interpolation,  however,  no  such  result 
is  possible,  because  of  the  following. 

THEOREM  A . For  any  integer  n > 1 , there  exists  £ convex  set 
of  data  points  (x^,y^),  0 < i < 4,  for  which  no  f e S^(A) 
satisfies  f(x^)  = y^,  0 < i < 4,  with  f convex  on  [xq,x^]. 

The  next  theorem  extends  the  result  of  [2]  to  s"  , but 

zn 

additional  knots  must  be  allowed  . The  proof  is  an  application 
of  Theorem  1. 

THEOREM  5.  Let  y^  < y^  x±  = (x^j+x^/2,  y±  = (yjL_1+yi)/2, 
i = 1,2, ... ,N,  and  let  A = {xq,x^,x^,X2» . . . ,x^,x^}.  Then,  for 
all  n > 1,  there  exists  an  increasing  f e S^tA)  such  that 
f(xA)  ■ i = 0,1,...  ,N,  and  f(xi)  - y^  i = 1,2,  ...,N. 

Theorem  5 holds  for  piecewise  monotone  interpolation  as 
well  (see  [2]). 


3 Conclusion 

In  closing  we  point  out  that  since  Theorem  1 yields  no 
better  than  f e S^,  any  attempt  to  decrease  the  deficiency 
of  the  interpolating  spline  will  require  a new  approach. 
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Eli  Passow  and  John  A.  Roulier 

In  this  paper  we  show  that  there  exist  functions 
f t C [ — 1 , +1]  with  all  r-th  order  divided  differences  uni- 
formly bounded  away  from  zero  for  r fixed  for  which  infi- 
nitely many  of  the  polynomials  of  best  approximation  to  f 
do  not  have  nonnegative  r-th  derivatives  on  [-1,  +1]. 

1 Introduction 

In  [1J  - [5]  there  appear  many  examples  of  functions  f 
in  C[a,b]  with  nonnegative  r-th  divided  differences  there 
for  which  infinitely  many  of  the  polynomials  of  best  approxi- 
mation to  f fail  to  have  nonnegative  r-th  derivatives.  None 
of  these  examples  has  the  r-th  divided  differences  uniformly 
bounded  away  from  zero.  In  [7]  Roulier  shows  that  if 
f t C ^ r l - 1 , +1]  and  if  f^r\x)  >_  6 > 0 on  [-1,  1]  then 
for  n sufficiently  large  the  polynomial  of  best  approxima- 
tion of  degree  less  than  or  equal  to  n has  a positive  r-th 
derivative  on  [-1,  +1]. 

On  the  other  hand  for  the  case  r = 1 Roulier  in  [6] 
siiows  that  first  divided  differences  of  f uniformly  bounded 
away  from  zero  is  not  sufficient  to  insure  that  for  n 
sufficiently  large  the  polynomial  of  best  approximation  to  f 
is  increasing  on  [-1,  1]. 

In  this  paper  we  extend  the  results  of  [6]  to  the  case 
when  r _>  1.  The  proofs  are  similar  to  those  in  [6]  but  make 
use  of  higher  order  divided  differences  and  their  properties. 
They  will  appear  elsewhere. 

Preceding  page  blank 
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2 Main  Theorems 

The  following  theorems  treat  the  situations  where 
f[x_,...,x  , n ] > 6 > 0 for  -1  < x-  <...<  x _ < 1 and 

0 r+1  _y  — 0 r+1  — 

either  f e Cr  [-1,  +1]  or  f e Cr[-1,  +1].  The  cases 

f e [ — 1 , +1]  j = r + l,...,2r  - 1 are  unsolved.  As  men- 

2r 

tioned  above,  the  case  f £ C [-1,  +1]  has  been  solved  [7]. 
In  what  follows  will  denote  the  polynomial  of  best 

approximation  to  f on  [-1,  +1]. 

THEOREM  1.  Let  f E C[-l,  1]  have  bounded  r-th  order  divided 

differences  (if  f e Cr[-1,  +1],  then  this  happens)  and  non-  - 
s t 

negative  r + 1 order  divided  differences  on  [-1,  +1] . 
Assume  that  there  is  no  C > 0 for  which 

E (f)  < C/(n  + 1)  for  n = 0,1 

n — 

Then  there  are  infinitely  many  n for  which  we  do  not  have 
p^r+1^(x)  ^ 0 ^n  [-1,  +1]. 

COROLLARY  2(a).  Let  f e Cr[-1,  +1]  and  assume  that  f has 
s t 

nonnegative  r+1  order  divided  differences  on  [-1,  +1] . 
Define  g(t)  = f(cost).  Assume  that 

(1)  lim  sup  k u +2.(g,  k = +°°- 

k -+•  “> 

Then  there  are  infinitely  many  n for  which  we  do  not  have 
f/r+1)(x)>°  on  [-1,  +1], 

COROLLARY  2(b).  If  f has  nonnegative  (r  + l)-st  order 
divided  differences  on  (-1  - e,  1 + e)  for  some  e > 0 and 
if  there  is  no  C > 0 f or  which 

r-f- 1 

En(f)  C/(n  + 1)  for  n = 0,  1,... 
then  there  are  infinitely  many  n for  which  we  do  not  have 

p(r+l)(x)  q +1], 
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ThEOREM  3.  Let  f e Cr  ^[-1,  +lj  and  assume  that  f has 
s t 

nonnegative  r + 1 order  divided  differences.  Assume  that 
there  is  no  C > 0 for  which 

E (f)  < C/(n  + l)r  for  n = 0,  1,...  . 

n — 

Then  there  are  infinitely  many  n for  which  we  do  not  have 
p^r+^  (x)  ■>  0 on  [-1,  +1], 

COROLLARY  4.  Let  f e Cr  ^[-1,  +1]  and  assume  that  f has 
s t 

nonnegative  r + 1 order  divided  differences . Define 
g(t)  = f (cos  t). 


Assume  that 

(2)  lim  sup  k ui  (g,  — )/log  k = +°°. 

i r k. 

k ->•  * 

Then  there  are  infinitely  many  n for  which  we  do  not  have 

p(r+l)(x)  >0  on  [-1,  +1]. 
n — — 

THEOREM  5.  Given  integer  r 0 and  modulus  of  continuity 
w there  exists  f e C [-1,  +1]  with 

(3)  f[xQ Xr+]_J  i 6 > 0 for  a11  x0  <•••<  xr+1 

in  [-1,  +1]  and  with 

(4)  u(h)  < co(f (r),  h)  <_  Kco(h) 


and  yet  there  are  inf initely  many 
p(r+1)(x)  > 0. 


n 


THEOREM  6.  Given  integer  r _>  1 

r ” 1 

u>  there  exists  f c C [-1,  +1] 


(5)  f [xQ, . . . ,xr+1J  _>  6 > 0 for 
in  [-1,  +1]  and  with 

w(h)  w(f (r  h)  <_  Km(h) 


n for  which  we  do  not  have 


and  modulus  of  continuity 
with 

all  x0  <...<  xr+1 
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and  yet  there  are  infinitely  many  n for  which  we  d£  not  have 

p(r+1)(x)  > 0. 
n — 
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T.  J.  Rivlin 

The  problem  considered  is  to  assign  a measure  of  circu- 
larity to  a given  compact  set  in  the  plane.  The  measure 
adopted  is  the  size  of  the  smallest  annulus  containing  the 
given  set.  Two  different  notions  of  the  size  of  an  annulus, 
that  of  area  and  that  of  difference  of  radii,  are  studied. 

1 . Introduction 

The  problem  we  consider  is  to  assign  a measure  of  cir- 
cularity to  a given  compact  set  in  the  plane.  Our  approach  is 
to  determine  the  best  annulus  which  contains  the  given  set  and 
rate  the  circularity  of  the  given  set  according  to  the  size  of 
the  annulus.  Of  course,  we  must  make  precise  the  sense  of 
"best"  and  "size"  in  the  preceding  sentence.  We  consider  two 
different  assignments  of  size  to  an  annulus.  The  first  is  its 
area,  the  second  the  difference  of  its  radii.  A notion  of 
size  having  been  fixed  we  define  a best  annular  approximation 
to  a given  compact  set  in  the  plane,  S,  to  be  an  annulus  of 
least  size  which  contains  S. 

It  turns  out  that  with  the  criterion  of  size  correspond- 
ing to  area  the  problem  of  best  annular  approximation  of  S is 
equivalent  to  a well-known  linear  uniform  approximation 
problem.  This  is  discussed  in  Section  2.  The  second  notion 
of  size  is  equivalent  to  a somewhat  novel  non-linear  approxi- 
mation problem  as  we  shall  see  in  Section  3. 

What  follows  is  a summary  of  selected  results  in  this 
problem  area.  A full  report  including  all  details  will  be 
published  elsewhere.  The  author  wishes  to  thank  his  colleagues 
A.  J.  Hoffman  and  C.  A.  Micchelli  for  valuable  suggestions. 
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2 . Least  area  criterion 

Given  a compact  set  S in  the  plane,  for  any  complex  num- 
ber, w,  put 

(1)  r2(w)  «=  max|z-w|  ; r..  (w)  = min|z-w|. 

zeS  zeS 

Clearly  S is  contained  in  the  annulus 

(2)  r^ (w)  < | z— w | < r2(w) 

2 2 

whose  area  is  A(w)  = ir(r2  -r^  ). 

If 

inf  A(w)  - A(w  ) 
w 


then  the  annulus  described  by  (2)  with  w = w is  a best  annulus 

o 

in  the  sense  of  area  which  contains  S. 

2 2 

Now  put  f(x,y)  - x +y  and  let  V be  the  linear  space  of 
linear  functions  ax+by+c.  f(x,y)  has  a best  uniform  approxi- 
mation out  of  V on  S. 


THEOREM  1.  If  w :(x  ,y  ) is  the  center  of  a best  annulus  in 
— o o o 

the  sense  of  area  for  S,  then 


VQ(x,y) 


2x  x+2y  y-(x  +y  - 
o 1 oJ  o o 


rl2(wo)+r22(wo) 


is  a best  approximation  to  f(x,y)  on  S out  of  V,  and 

M=  llf-v0ll  * (*2  (Wo)-1-!  (w0))/2*  Conversely,  if 

VQ(x,y)  2xQX+2yoy+co  is  a best  uniform  approximation  to 

f(x,y)  on  S out  of  V then  w : (x  ,y  ) is  the  center  of  a best 
— o o o 3 

annulus  for  S,  and  if  M - | | f-v  | | .then  r„2(w  ) = c +x  2+v  2+M: 
o'  2 o o o ■'o  * 

2 2 2 2 2 
rl  (wo)  = c0+x0  +yQ  -M,  and  hence  r2  (wt,)-r1  (WQ)  “ 2M- 

This  equivalence  establishes  the  existence  of  a best 
annular  approximation  and  sheds  light  on  uniqueness  questions. 
The  characterization  of  best  linear  approximation  here  is 
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elegant  and  simple  in  terms  of  the  notion  of  extremal  signature 
(cf.  Rivlin  and  Shapiro,  SIAM  Journal,  J?,  1961,  670-699).  It 
enables  us,  for  example,  to  determine  best  annular  approxima- 
tions to  sets  S consisting  of  a finite  number  of  points,  and  to 
establish  the  following  geometric  fact.  If  S is  an  oval  then 
there  is  a pair  of  concentric  circles  of  which  the  inner  one 
lies  inside  the  oval  and  touches  it  at  at  least  two  distinct 
points,  and  the  outer  one  lies  outside  the  oval  and  touches  it 
at  at  least  two  distinct  points.  The  question  of  what  is  the 
most  general  class  of  simple  closed  curves  having  this  prop- 
erty is  open  and  seems  difficult. 

3 . Difference  of  radii  criterion 

In  this  section  we  retain  the  notation  of  Section  2,  but 
here  instead  of  A(w)  we  are  interested  in  r(w)  = ^ (w) -r  ^ (w)  . 
If 

inf  r(w)  = r(w  ) 
o 
w 

we  call  the  annulus  described  by  (2)  with  w = wq  a best  ap- 
proximation in  the  uniform  sense  to  S. 

We  note  at  once  that  a best  uniform  annular  approximation 
need  not  exist.  For  example,  it  does  not  exist  if  S consists 
of  3 or  more  collinear  points.  However,  a uniform  approxima- 
tion problem  equivalent  to  the  problem  of  best  uniform  annular 
approximation  does  exist. 

Consider 

(3)  F(x,y;  h,k,t)  - ((x-h^+Cy-k)2)1^2  -t. 

Then  the  following  equivalence  is  quite  obvious. 

LEMMA  1. 

V - min  max  | F(x,y;h,k,t) | - max  |F(x,y;  h,k,t)| 
h,k,t  (x,y) eS  (x,y)cS 

if,  and  only  if,  the  annulus  centered  at  w: (h,k)  with 
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r^(w)  = t-y  and  r^(w)  = t+y  is  a best  uniform  annular 
approximation  to  S. 

Thus  the  equivalent  non-linear  approximation  problem  re- 
ferred to  is  that  of  finding  a best  approximation  on  S to  zero 
by  means  of  functions  of  the  form  (3).  A familiar  kind  of 
argument  now  yields  the  following  necessary  condition. 

THEOREM  2.  1^  w:  (h,k)  is  the  center  of  an  annulus  which  is  a 

best  uniform  approximation  to  S,  with  0 < r^(w)  < r^ (w) , then 
there  exist  a positive  Integer  k < 4,  positive  numbers 
A^ , . . . , A^,  points  z^ , . . . , z^  of  S and  disjoint  subsets  1^  and 

I2  of  {l,...,k}  such  that  l^l  + lljJ  = k, 

k 

n ■ 1, 

J-i  J 


L A 


J*I, 


j’ 


and 


I A 


z -w 


Z A, 


!iw 


j€ X1  ^ r1(w)  jel2  ^ r2(w) 


where  |zj-w|  - ^ for  and  |z  -w|  =*  r2  _£er  jcl2. 

In  certain  simple  examples  these  conditons  give  a good 
deal  of  information  about  best  approximations.  For  instance, 
when  S consists  of  4 points  a best  uniform  annulus  for  S has 
2 points  of  S on  its  outer  boundary  and  2 on  its  inner  bound- 
ary; thus,  in  view  of  the  results  mentioned  in  Section  2,  it 
cannot  coincide  with  a best  annulus  in  the  sense  of  area  in 
many  cases. 
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Sufficient  conditions  for  best  uniform  annular  approxi- 
mation seem  hard  to  come  by  and  are  under  investigation. 
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THE  BEHAVIOR  OF  THE  PADE  TABLE 
FOR  THE  EXPONENTIAL 

E.B.  Saff  and  R.S.  Varga 

In  this  paper  we  survey  recent  results  and  present  some 
new  theorems  on  the  behavior  of  Pade  approximants  for  e~z 
The  new  results  include  necessary  and  sufficient  conditions  for 
(1)  a sequence  of  approximants  to  be  pole-free  in  an  infinite 
sector,  and  (2)  a sequence  of  approximants  to  converge  geomet- 
rically in  the  uniform  norm  over  an  infinite  sector. 

1 Introduction 

While  the  study  of  the  Pade  table  for  the  exponential 
function  dates  back  to  Pade's  thesis,  there  has  been  renewed 
interest  in  the  subject  because  of  its  usefulness  in  certain 
numerical  schemes  for  solving  parabolic  differential  equations. 
Several  recent  papers  have  appeared  which  consider  the  ques- 
tions of  location  of  zeros  and  poles,  regions  of  convergence, 
and  degree  of  convergence  of  sequences  from  the  table  (see  [3] , 
[9J  » (lO J * [I4], [l6]  ) • The  purpose  of  the  present  paper  is  to 
survey  some  of  these  results  and  also  to  establish  some  new 
theorems.  In  this  first  section  we  introduce  the  necessary  no- 
tation, in  Sec.  2 we  discuss  zero  and  pole-free  regions,  and 
in  Sec.  3 we  consider  the  degree  of  convergence  of  Pade  approx- 
imants in  unbounded  regions. 

To  be  specific  we  shall  deal  with  the  complex  negative  ex- 
_2 

ponential  function  e . For  each  pair  (v,n)  of  nonnega- 
tive integers  the  Pad£  approx lmant  R (z)  of  type  (v,n)  for 
— z 

e is  defined  as  that  unique  rational  function  with  numerator 
degree  v , denominator  degree  n , which  has  greatest  con- 

— 2 

tact  with  e at  the  origin,  l.e., 

Preceding  page  blank 
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(1.1)  e Z - R (*) 

v,n 


. n+v+1 . _ 

0(z  ) as  z-*-0 


Explicitly  it  is  known  that  R (z)  = Q (z)/P  (z)  , 

* J • m n \)  n v.n 


v,n 


v,n 


where 


M n (y't  = ? (n+v-j)  !v!  (-z)j 

(1'2)  Qv,n(z)  / (n-K>) ! j ! (v-j ) ! 


and 

(1.3) 


P (z) 
v,n 


j'o 
n 

l 

j=o 


n (n+v-j ) In ! z^ 
(n+v) !j ! (n-j)  ! 


The  polynomials  Q (z)  and  P (z)  are  referred  to  re- 
v 1 xv,n  v,n 

spectively  as  the  Pade  numerator  and  Pade  denominator  of  type 


(v,n)  for  e 


-z 


From  the  representations  (1.2)  and  (1.3) 


it  is  apparent  that  n(z) 


P (-z)  , and  so  any  result  on 

n,v  * 

“Z 

the  location  of  the  poles  of  Pad£  approximants  to  e has  a 


reformulation  in  terms  of  zeros. 

The  Pade  approximants  R^  ^(z)  are  usually  studied  in  the 
context  of  the  following  doubly  infinite  array  known  as  the 
Pade  table: 


(1.4) 


0,0 

R1,0 

R2,0  ’ * * 

0,1 

Rl,l 

R2,l  * ‘ * 

0,2 

Rl,2 

R2,2  * ‘ ’ 

• 

• 

• 

• 

• 

• 

Notice  that  the  first  row  of  the  table  consists  of  the  partial 

sums  R (z)  • ^ (-z)k/k!  of  e Z and  that  the  first  col- 

v,°  k*o 

umn  is  composed  of  the  reciprocals  of  the  partial  sums  for  the 

n k _2 

positive  exponential,  i.e.,  R (z)  - [ £ z /kll 

0 .11  , 

k«o 
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2 Unbounded  pole-free  regions 

The  asymptotic  behavior  of  the  poles  of  the  first  column 

r k 

of  (1.4),  i.e.,  the  zeros  of  the  partial  sums  s (z)  = l z /k!  , 

^ ^ kc0 

was  studied  by  Szego  £l3^  and  Dieudonne  As  a consequence 

of  their  results  it  follows  that  any  infinite  sector  with  ver- 
tex at  the  origin  contains  infinitely  many  poles  of  the  se- 
00 

quence  (R  (z)}  . By  way  of  contrast  it  is  shown  in 

o,n  n*=o  J 

R.S.  Varga's  thesis  [17}  that  the  infinite  half-strip  |y|^/6  , 

00 

x>0  , is  free  of  poles  of  the  v'hole  sequence  (R  (z)}  . 

o j n n*=  o 

More  recently,  Newman  and  Rivlin  C (A  3 » & ^ ) established  that 
there  exists  an  unbounded  parabolic  region,  namely 

(2.1)  y2  dx  , x >_  0 , d • 0.745  , 

00 

which  is  pole-free  for  the  sequence  (R  (z)  } . Further- 

o,n  n=o 

more  they  proved  that  for  this  sequence  parabolic  growth  char- 
acterizes the  largest  pole-free  region  symmetric  about  the  pos- 
itive real  axis. 

Using  continued  fraction  techniques  the  authors  were  able 
to  improve  upon  the  result  of  (2.1)  and  also  to  obtain  similar 
results  for  all  the  columns  of  the  table  (1.4).  In  stating 
this  theorem  it  is  convenient  to  introduce  the  normalized  Pade 
approxlmants  R^  n((v+l)z) 

THEOREM  2.1.  (Saff,  Varga  [83, [ll})*  For  all  v^O  , n»0  , the 
normalized  Pade  approxlmant  R^  ^((v+l)z)  has  no  poles  in  the 
unbounded  parabolic  region 

(2.2)  P^-fz  * x + iy  : y2  4(x  + 1)  , x > -1} 

Moreover,  every  boundary  point  of  is  a limit  point  of 

poles  of  the  collection  (R  ((v+l)z)}°° 
v,n  v-o,n-=o 

In  particular,  Theorem  2.1  implies  that  the  first  column 
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of  the  table  (1.4),  for  which  v-0  , is  pole-free  in  P^  (a 

region  larger  than  that  of  (2.1))  and,  in  general,  the  (v-H)st 
00 

column  (R  (z)}  is  pole-free  in  the  parabolic  region 
v,n  n-o 

(2.3)  Pv+1:“  {z“x+iy  : y2  ± 4(v+l)  (x+v+1)  , x > -(v+1)}  . 


These  facts  have  proved  useful  in  approximation  estimates  for 
the  matrix  exponential  as  discussed  in  a recent  paper  of 
Van  Loan  [l5]  . 

While  Theorem  2.1  is  sharp,  it  does  not  include  the  fact 
that  for  an  arbitrary  fixed  v the  largest  pole-free  region 
for  the  sequence 
prove  this  in 


(R  (z) } has  parabolic  growth.  We  shall 
v,n  n-o  r e 


THEOREM  2.2.  For  each  fixed  v>0  , the  Pade  approximant 

R^  ^(z)  for  e has  ji  pole  of  the  form 

(2.4)  (nWn  x ) + 1*^7  y , where  x + iy  -*v  (fO) 

v,n  'v,n  v,n  ■'v.n  v 

as  n-*» 


Note  that  as 
2 


ny 


v,n 


lim 
n-*»  n+i^n  x 


(Im  w^)‘ 


v,n 

there  are  poles  of  the  R 

2 v,“  ? 

the  parabolic  arc  y -(Im  w ) x , as  n-*-> 


(z)  which  asymptotically  fall  on 
When  v-0  , 

Theorem  2.2  reduces  to  the  known  result  of  Newman  and  Rivlin 
[4].  The  proof  of  Theorem  2.2  requires  the  following  lemma : 


LEMMA  2.1.  For  each  nonnegative  Integer  v , the  function 

(2.5)  F (*):-  P°  tVe-Zt  ^2dt  , (0<t«*>)  , 

v Jo  — 

Is  an  entire  function  having  at  least  one  (finite)  zero  w (**0). 

Proof.  It  is  easy  to  see  that  F^z)  is  entire.  More  pre- 
cisely, on  writing 
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(2.6)  F(z)  - l a,(v)zk  , 

k«o 

it  follows  from  (2.5)  that 

.k-fv-1. 

(-Dk  2 2 rr^ii) 

(2.7)  afc(v)  - , k>0  . 

Using  Stirling's  formula  one  can  verify  from  (2.7)  that  F (z) 

v 

is  of  order  2 for  each  v and,  moreover,  F (z)  is  an  en- 
tire function  of  perfectly  regular  growth;  specifically  if 
M_  (r):=  max  If  (z) I , then 

r II  V 

v |z|-r 

In  Mp  (r) 

(2.8)  lim  j . 

r-K»  r 

Now  assume  to  the  contrary  that  F (z)  has  no  zeros.  By 
the  Hadamard  Factorization  Theorem  ([l,p.22]),  we  can  express 
F^(z)  as  F (z)“e^Z^  , where  q(z)  is  a polynomial  of  de- 
gree not  exceeding  the  order  of  F^  . Hence,  since  F is  of 
order  2 , there  exist  constants  a-pO^  such  that 

2 2 
a..z+a9z  a1z+n9z 

(2.9)  F (z)  - F (0)e  1 = a (v)e  , for  all  z . 

V V o 

Using  (2.7),  (2.8),  and  the  fact  that  (r)  = F (-r)  , it  is 

easy  to  show  that  v 

«2-i  , - -/I  r(^)/r(^±i)  , 

and  hence  the  right-hand  member  of  (2.9)  is  completely  speci- 

2 

fied.  Equating  the  coefficients  of  z in  (2.9)  results  in 
the  equation 

v+1  v— 1 
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which  after  some  minor  manipulations  becomes 

(2.10)  v(r(^±i))2  = 2(r(^p))2  . 

But  this  equality  must  fail  for  every  v>0  . Indeed,  if  v=0, 

the  left  side  of  (2.10)  vanishes,  while  the  right  side  is  2 . 

If  v is  positive,  one  side  of  (2.10)  is  an  integer,  while  the 

other  side  is  a rational  multiple  of  tt  . Thus  the  assumption 

that  F has  no  zeros  yields  a contradiction,  and  Lemma  2.1 
v J 

is  proved.  | 

We  can  now  give  the 

Proof  of  Theorem  2.2.  Since  for  each  pair  (v,n)  , the  Pade 

numerator  Q (z)  and  Pade  denominator  P (z)  have  no  com- 
v,n  v,n 

mon  factors,  it  suffices  to  show  that  for  each  fixed  v , the 
polynomials  P^  ^(z)  have  zeros  of  the  form  (2. A).  Using  the 
representation  (1.3)  the  following  integral  formula  can  be 
derived : 

r 

(2.11)  (n+v)!  P (z)  * e t(t+z)ntVdt  , (0<t<+«) 

,n  J0 

Letting  z = n + Sn  w and  making  the  change  of  variables 
t=/n  u , 0<u<+“  , in  (2.11)  we  find  that 


(2.12)  (n+v)!  P (n+i/n  w) 

v,n 


2n+v+l 

2 


n 


.to 

O 


-/n  u , w+u ,n 
e {1  + } 


uvdu . 


The  logarithm  of  the  integrand  above  is,  for  u and  w fixed 
and  n large, 

2 2 1 
(i^n  w - —)  - wu  - y~  + vln  u + 0(— ) , 

and  so 

2 2 

-/rTu  , w+w.n  v,  /n  w-w  /2  v -wu-u  /2 
lira  e {1  + } u /e  = u e 

n-x»  Sn 

Now  the  proof  given  by  Newman  and  Rivlin  [4]  can  be  adapted  here 
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to  show,  using  the  Lebesgue  Dominated  Convergence  Theorem,  that 


(2.13) 


lim 

n-+» 


(n+v)  ! P (n+i^n  w) 
v,n  

- 2n+v+l 

v^n  w-w  /2  2 

e n 


.00 


v -wu-u 
u e 


-:Fu(w) 


> 


the  convergence  being  uniform  on  compact  subsets  of  the  w- 

plane.  Since,  by  Lemma  2.1,  F^(w)(£0)  has  a finite  zero,  say 

, Hurwitz's  Theorem  implies  that  n(n+*^T  w)  possesses  a 

zero,  say  w=w  . such  that  w -+w  as  n-**0  . This  means 

v,n  * v,n  v 

P (z)  has  a zero  of  the  form  (2.4).  I 
v , n 

Concerning  pole-free  sectors  for  the  Pade  approximants 

R (z)  the  following  is  known: 
v 

THEOREM  2.3  (Saff,  Varga  C 93 , Qll]).  For  every  v>0  , n>2  , 

the  Pade  approx imant  Ry  n(z)  for  e z has  no  poles  in  the 
infinite  sector 

(2.14)  Sv  n:**{z:|arg  z|  <_  cos~1(n~^~2-) } . 

Furthermore,  for  any  fixed  a , 0<o<+co  , each  element  in  the 

oo 

sequence  of  approximants  {R^  ^ (z)}^^  satisfying 

v +i 

(2.15)  lim  n -+®  , lim  v./n.=a  , and  ( J 1 ) >_  a , 

j*.  J J - V 

for  all  j>l  , ij^  pole-free  in  the  infinite  sector 

(2.16)  Sa:={z:|arg  z|  <_  cos  ^-j^-)}  » 


and  i£  the  largest  sector  of  the  form  |arg  z | <_  y , p>0, 

which  is  devoid  of  all  poles  of  any  sequence  of  approximants 


{R  (z) },  , satisfying  (2.15). 

V*j  J_1  

In  particular,  for  any  (fixed)  o>0 


is  the  largest 


pole-free  sector  of  the  form  arg  z | <_  p , y>0  , for  the 
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sequence  (ft [on]  n^z^nmi  » where  CO  denotes  the  greatest 
integer  function.  This  fact  has  an  interesting  geometric  in- 
terpretation as  explained  in  C9]. 

Using  Theorems  2.2,  2.3,  and  the  results  in  [11].  we  can 
deduce  the  following  new  result  *. 

THEOREM  2.4.  A necessary  and  sufficient  condit ion  that  £ se- 
quence  of  Pade  approximants  {R^  n » with  n^-**  * 


y>0  , is  that 


{R  (z)}. 

Vk,nk  k 1 

pole-free  in  some  infinite  sector  | arg  z | < 

(2.17)  lim  inf  > 0 

Proof.  The  sufficiency  part  follows  immediately  from  Theorem 
2.3.  To  prove  necessity  assume  that  (v^*^)  Is  a sequence 
such  that  ant*  11®  inf^)oo  v^/n^O  • ®ur  a*m  1®  to  show 

that  for  every  y>0  , there  are  infinitely  many  poles  of  the 

sequence  tK-t  _ UHW1  in  the  sector  |arg  z|  < y . For 


(R  (z)}.  n 

\*nk  k_1 


this  purpose  let  {(v.  ,nj))j^ 


denote  a subsequence  of 


{(v.  ,n.)},  . for  which  11m  v./n  =0  . We  consider  two  separate 
k k.  k-i  ] 3 

cases: 


Case  1:  If  some  subsequence  of  {v.}.  . is  bounded,  then 

J ^ oo  00 

there  is  evidently  a subsequence  { (v^jti^)  of  {(v^.n^)}^^ 
for  which  is  constant,  say  v^=v  for  all  t> 1 , and  for 


But  as  a consequence  of  Theorem  2.2,  the 


which  11m  n»=  « 

b+”°  * 

sequence  {R  (z)}»  - has  poles  which  asymptotically  (as 

V | 4Lm  X 

n^-H*)  lie  on  some  parabola  opening  about  the  positive  real 
axis.  Therefore  the  sequence  has  infinitely  many  poles  in  any 
sector  of  the  form  ]arg  z|  < y , y>0 


Case  2: 


If 


as 


and  lim  v./n  =0  , 

j-*»  J J 


then  the 


result  of  Corollary  3.1  of  tu]  applied  to  the  sequence 
{P  (z)}  of  Pade  denominators  for  e z again  shows  that 

Vj,nj 
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there  is  no  pole-free  sector  of  the  form  |arg  z|  < y , y>0  , 
for  the  sequence  {R  (z)},  - . I 

Vnj  J 

The  next  theorem  has  application  to  stability  questions 
and  extends  results  in  £3]  and  [18} . 

THEOREM  2.5  (Saff,  Varga  [9]) . lf_  n<v+4  , the  ?adi  approxi- 

mant  R^  R(z)  for  e z has  all  its  poles  in  the  open  left 
half-plane. 

The  above  theorem  is  sharp  in  the  sense  that  the  approxi- 

mant  R c(z)  , for  which  n=v+5  , does  in  fact  have  a pole 
o,5 

in  the  right  half-plane.  However  the  following  assertion  can 

be  made  with  regard  to  diagonal  sequences  of  the  table  (1.4)  of 

the  form  {R  (z)}  , t>5: 

n-T,n  n“T  — 

THEOREM  2.6  (Saff,  Varga  [9]).  For  any  integer  t>5  , there 

exists  an  Integer  m-m(T)  such  that  the  approximants 

{R  (z)}*  have  all  their  poles  in  the  open  left  half- 
n-T,n  n-m r c 

plane . 

3 Geometric  convergence  of  Pade  approximants 
in  unbounded  regions 

In  this  section  we  discuss  results  concerning  geometric 
convergence  of  Pade  approximants  on  the  nonnegative  ray 
(b,+“)  , and  on  infinite  sectors  of  the  form  |arg  z | <_ y , 

y>0  . First  we  set 

(3'1)  ' 

Notice  that  when  v>n  , we  have  n “le  -R  (®)  | **  00 

v,n  v,n 

When  v<n  the  following  estimates  are  known: 

THEOREM  3.1  (Saff,  Varga,  Ni  [12} ) . For  any  nonnegative  inte- 
gers v and  n with  0^v<n  , there  holds 
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(3.2) 

where 


2n~V0  (n+1 ) 2 nv»n  2n“V(") 
y jL£  ji  positive  constant  independent  of 


v and  n 


To  state  the  next  theorem  we  need  the  function  g(B)  de- 
fined for  0<j3^1  by 

B 1-6 

(3.3)  g(B)  :■  , 0<B<1  , g(0):=l/2  , g(l):=l  . 

2i-e 

THEOREM  3.2  (Saff,  Varga,  Ni  [l2]).  Let  {v(n)}"=1  be  a s£- 

quence  of  nonnegative  integers  with  0<v(n)<n  for  all  n , 

and  satisfying  lim  v(n)/n  * B . Then 
n-**> 


(3. A) 


lim  n 

n-**> 


1/n 

v(n),n 


= g(6) 


As  min  g(B)  = g (1/ 3)  = 
°<Bll 

theorem  that  for  any  sequence 


lim  inf 
n-H*> 


1/n  > I 

nv(n) ,n  — 3 


1/3  , it  follows  from  the  above 

on 

(v(n)}  , , there  holds 

n=l 


(Rr  /VI  (x)}  . 

Ln/3J,n  n=l 


with  equality  possible  for  the  sequence 
Indeed  numerical  computations  appear  to  indicate  that  for  each 

v=0,l,2,...,  occurs  when 


fixed  n the  smallest  error  n 
v=[n/3] 


v,n 


Another  consequence  of  Theorem  3.2  is  stated  in 


THEOREM  3.3  (Saff,  Varga,  Ni  fl2] ) . A necessary  and  sufficient 

00 

condition  that  a sequence  of  Pade  approximants  n^X^n=l 

converges  geometrically  in  the  uniform  norm  to  e-x  on  [0,+®) 
is  that 

(3.5)  lim  sup  < 1 . 

IFKO 

Concerning  geometric  convergence  in  the  uniform  norm  over 
infinite  sectors  we  shall  prove  the  following  new  result: 
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THEOREM  3.4.  A necessary  and  sufficient  condition  that  £ se- 
quence of  Pade  approx imants  ^v(n)  n^z^n=l  conver%e8  geo- 

— z 

metrically  In  the  uniform  norm  to  e _in  some  inf inlte  sector 
S^:»{z:|arg  z | <_  p } , p>0  , Is  that 

(3.6)  0 < lim  inf  < lim  sup  - < 1 

n — r n 

n-M»  n-H*> 

Proof.  That  condition  (3.6)  is  sufficient  to  ensure  geometric 
convergence  in  some  , p>0  , is  proved  in  [12].  To  demon- 

strate necessity  we  assume  that  for  some  p>0  , the  sequence 

(Rv(n),n(z),n-l  “ti8fles 

(3.7)  11m  sup  ||e‘z-R  < 1 . 

n-H»  ’ <»  p 

Since  contains  the  ray  (0,+»)  , it  follows  from  Theorem 

3.3  that  lim  sup  v(n)/n  < 1 . Furthermore, as  (3.7)  evidently 

n-M» 

implies  that  for  n large  enough,  the  poles  of  the  sequence 

{R  , . (z)}  must  omit  the  sector  S , Theorem  2.4  implies 

v (n)  ,n  P v 

lim  inf  v(n)/n  >0  .1 

n-H*> 

Concerning  estimates  for  the  size  of  the  sector  S of 

P 

geometric  convergence  for  a sequence  satisfying  (3.6),  the 

reader  is  referred  to  We  remark  that  although  no  column 

■>2 

of  the  table  (1.4)  converges  geometrically  to  e in  an  in- 
finite sector,  each  column  does,  in  fact,  converge  geometri- 
cally to  e z on  an  unbounded  parabolic  region  (see  [10J). 

Of  course  the  poles  of  the  Pade  approximants  to  e are, 
in  general,  not  all  real.  For  computational  purposes  it  is 
sometimes  desirable  to  deal  with  rational  approximations  whose 
poles  are  all  real  and  coincident.  In  VJ  it  is  shown  that 
there  exists  a sequence  of  rational  functions  of  the  form 
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rn(x) 


such  that 


■Vi00 
u+  2)” 


deg  Pn-1  - n_1  » n-1.2.***. 


I'"*  ' rn(l‘)lll>,[0,+.)  ' as  • 


Some  further  properties  of  this  sequence  are  discussed  in  VI- 
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CALCULATION  OF  BEST  APPROXIMATIONS  BY  RATIONAL  SPLINES 


R.  Schaback 

The  purpose  of  this  paper  is  to  construct  best  Chebyshev 
approximations  from  a class  y of  rational  spline  functions 
having  fixed  knots  and  free  simple  poles.  As  a first  step, 
linear  tangent  spaces  consisting  of  rational  splines  with  fixed 
knots  and  free  double  poles  are  calculated.  The  next  step  is 
the  proof  of  strong  uniqueness  properties  of  best  approximations 
from  y and  from  the  tangent  spaces.  This  implies  quadratic  local 
convergence  of  a Newton-type  algorithm  by  L.  Cromme  for  calcu- 
lating best  approximations  from  y.The  numerical  performance  of 
the  algorithm  is  discussed. 

I Introduction 

Let  1 = |_a,b  J be  a compact  interval  of  the  real  numbers  R, 

divided  into  sub intervals  I.  = x. ,x.  , 1 for  j = l....,n  by 

J L J J+l  J J 7 

fixed  "knots"  a = x < x ,<...<  x ^ .=  b . We  consider  best 

o 1 n+ 1 

approximations  s to  given  functions  f e C(I)  with  respect  to 
the  Chebyshev  (uniform)  norm  on  C(I).  The  approximations  are 
taken  from  the  set 


2 

(1)  y ” f s c C (I)  [ s"  > o,  s = — 2-  on  I.  q.  e P.,  p.  t P.  1 

qj  J.  J I *J  2 1 

of  special  rational  spline  functions  [ 1 , 2 , 5 , 8 ] , where  P^  denotes 
the  space  of  polynomials  not  exceeding  k.  The  class  of  all  ra- 
tional spline  functions  is  exactly  the  class  of  functions  pro- 
cessed by  computers.  Thus  (1)  serves  as  a step  towards  the  in- 
vestigation of  the  best  possible  approximation  of  a function  on 
a computer.  Another  argument  for  considering  rational  splines 
is  the  combination  of  the  flexibility  of  splines  and  rational 
functions  (see  examples  in  [5,8]. 

Preceding  page  blank 
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The  main  result  of  this  paper  (Theorem  2 below)  states 
that  any  locally  strongly  unique  best  approximation  (for  which 
a sufficient  criterion  is  given)  can  be  calculated  by  a locally 
quadratical ly  convergent  iteration  involving  only  linear 
approximation  problems.  The  concept  of  strong  uniqueness  is 
used  here  to  ensure  good  numerical  behavior;  this  is  quite  na- 
tural in  view  of  the  theorem  of  Freud  [3  J,  stating  that  strong 
uniqueness  ensures  Lipschitz  continuous  dependence  of  best 
approximations  from  the  data. 

2 Parametrization  of  y 


On  the  open  set 
A = { a = (y„,M^, . . . ,M 


o o 


n+1 ,yn+l 


) e R 


n+A 


Mo’-'-’Mn+,  > 


° } 


,n+4 


in  R we  can  parametrize  y by  regarding  vectors  a c A as  vec- 

3 3 

tors  of  values  (s (x  ) , /s" (xq) , . . . , /s" (x  j ) ,s(x^+|))  of  a 
spline  s = F(a)  e y.  The  explicit  construction  of  s from  a is 
carried  out  by  evaluating  (3. A)  in  [ 5 _]  , solving  (2)  for 
y I , . . . ,yn>  and  evaluating  (3.7)  in  ["5  I.  For  fixed  a e A we  get 

the  value  F'(b)  = u of  the  Frechet  derivative  of  F at  a with 

3 n+A 

argument  b = (z  ,N  ,...,N  ,,z  .)  c R by  the  formulae 

o o n+ I n+ 1 


2 D. 
J 


(2)  D.  = 


(h.N.  , + h.N.  , )M7  + 2M.N.(h.M.  , + h.M.  ,), 
J J-l  J J+1  J J J J J"'  J J+> 


j h . + h . 

J-l  J 


JLiL 


- y . y . 
J.  _ _J_ 


h. 

J 


hj-l 


^ — “)  (I  < j < n) , 


(which  is  to  be  solved  for  y ^ , . . . , y n » setting  y^  = z^, 


n+l 


zn+l)*  and 


u(t)  - yjtI*(t-x.t|)(-4-3  * {h.Hjt|(H.  + |N..2H.Njt|)) 


+ v. (t) (3h.M.N.  . - (x . -t) (3M .N .-2M.M..N.  -M7  N . ) ) 

J 1 J J J+1  J+l  J J+l  J J+l  J+1  J+1  J ’ 

2 


Vj  (t) 


Mi+l(t-Xi+l) 


2hj(1  - (xj+i-c)  -tn-) 

j j 


h . = x.  , - x . , 
J J + l J 
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for  t E Ij.  J = o,...,n.  Then 

u(x.)  = y.,  u"(x.)  = 3M?N . , j = o,...,n+). 

J J J J J J 

These  formulae  are  the  derivatives  of  those  defining  F.  Since 
for  fixed  a there  are  no  difficulties  concerning  singularities, 
one  can  show  that  they  actually  describe  the  continous  Frechet 
derivative  F^  of  F at  a.  These  parametri zations  define  certain 
interpolating  splines;  nevertheless  they  are  explicit  except 
for  solving  a 3-banded  linear  system  of  equations. 


3 Properties  of  the  tangent  spaces 
We  first  recall  the  concept  of  strong  uniqueness  : 


DEFINITION.  An  approximation  v from  a set  V £f_  elements  of  a> 
normed  space  X is^  £ (locally)  strongly  unique  best  approxima- 
tion to  a given  element  f e X iff  there  is  £ K > o such  that 

||w  ~ f ||  * ||  v ~ f ||+  K|(w  - v || 


holds  for  a 1 1 w t V (for  al  1 w e U n V,  where  U j_s  a neighbor- 
hood of  u,  respectively) . 

• n^4 

For  s = F(a)  in  the  above  sense,  let  T = F' (R  ) denote 

s a 

the  tangent  space  to  y in  s.  We  now  list  some  properties  of  T^ : 


THEOREM  1 . Let  s have  denominators  q^  ui  I ^ , j = o, . . . ,n,  and 

assume  q.  > o in  I..  Then,  for  j = o,...,n  : 

— J 


(a) 

(b) 

(c) 

(d) 


2 

If  r.  c P„,  then  (r./q.)"  has  at  most  one  zero  in  I.  or 

— j 3 — j j — 

vanishes  identical ly . 

2 

T coincides  with  the  space  of  C -functions  on  I having 
S 2 

the  form  r./q.  in  I.,  where  r.  c P0. 

r j j ~ j J 3 

The  Frechet  derivative  F'  defined  above  is  an  isomorphism 
n+4 

between  R and  T . 


Let  g e C(I)  al  ternate  in  n+5  points  tj  < < •••  < tn+5 

of  I and  assume 


1 
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(3) 


t . . < X.  < 

1+1  1 


i+4’ 


1.. 


Then  0 j^s  a strongly  unique  best  approximation  _to  g iji  T . 
(e)  Let  the  error  function  g = f - s for  f e C(I)  and  s 
have  the  properties  assumed  above . 

Then  s is  a unique  best  approximation  to  f ; j_n  addition, 
s i_s  locally  strongly  unique . 


Indication  of  proof  : Statement  (a)  follows  by  direct  cal- 

culation. To  show  (b)  and  (c) , one  has  to  apply  unicity  argu- 
ments (see  Satz  5 in  [5])  for  suitable  interpolating  functions 
provided  by  section  2.  The  proof  of  (d)  proceeds  along  the 
lines  of  [O  see  examples  6 and  7 there),  while  (e)  follows 
from  (d)  by  results  of  Braess  and  Werner  ([*2^  Satz  2.4),  and 
of  Wulbert  ([*9].  Lemma  9). 


Remarks  : (I)  Condition  (3)  is  satisfied  for  the  approxima- 

tion of  generalized  monosplines  (Braess  [l  ])  . 

(2)  Condition  (3)  seems  to  be  inevitable.  This  is  indicated 
by  the  uniqueness  properties  of  approximations  by  cubic 
splines  7 1 , since  these  form  T^  in  case  s is  a parabola. 

(3)  The  spaces  Tg  are  generalizations  of  cubic  splines;  they 
can  easily  be  generalized  and  seem  worth  investigating. 

(4)  The  results  of  this  section  do  not  carry  over  to  the 
closure  y of  y studied  by  Werner  and  Braess  [2,8].  Even  for 
n ■ 1 the  characterization  of  locally  strongly  unique  best 
approximations  is  rather  complicated. 

(5)  Theorem  I remains  valid  if  I is  replaced  by  a discrete  set 
of  points  containing  the  knots  and  at  least  n+5  points. 
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4 Numerical  procedure 

We  apply  the  following  algorithm  of  Cromme  [4  j : 

START  : Choose  a*  e A. 

ITERATION  : Given  a1  c A,  calculate  FCa1)  = e y and  a bes t 

approximation  u . e T Ui  f - s . yielding  a parameter  vector 

,i  0n+4  _ 1 i+1  i ,,  i . 

b e R . Form  a new  parameter  vector  a - - 


a + Ab  with 


a suitable  A e (o,  I J J^o  ensure  a1+ ' e A 
We  then  have  according  to  [4] 


THEOREM  2.  Let  the  hypotheses  of  Theorem  1 , (e)  prevail . Then 
the  above  algorithm  converges  quadratical ly  to  s i_f  Lt  is 
started  sufficiently  near  s.  Furthermore,  one  can  set  A=  1 in 
this  case. 


Some  remarks  to  this  algorithm  seem  appropriate  : 

(1)  The  algorithm  does  not  involve  any  nonlinear  processes;  it 
consists  only  of  evaluations  like  those  in  section  2 and,  of 
course,  a sequence  of  linear  approximation  problems. 

(2)  The  parameter  A is  used  to  enlarge  the  convergence  domain 
(by  using  a small  A.  whenever  A.=  l would  lead  out  of  A)  and  to  allow 
the  creeping  to  the  boundary  of  A. 


(3)  Of  course  one  could  seek  an  optimal  A in  the  way  that 
||f  - F(aX  + 'M)1)||  is  minimized.  On  the  other  hand,  one  could 
split  A into  n+4  components  and  adjust  them  separately  to  main- 
tain a1  e A.  The  performance  of  these  variations  of  the  algo- 
rithm is  yet  to  be  tested. 


5 Numerical  examples 

Of  course  there  are  many  examples  showing  the  features 
of  the  algorithm  stated  above.  As  a typical  case,  the  approxi- 
mation of  cosh(x)  on  [-1,  + 1 ] always  converged  quadratically 
and  it  never  took  more  than  5 iterations  to  give  a stable 
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13-digit  precision.  Several  placements  of  up  to  5 inner  knots 

were  tested,  and  the  simple  one-sided  starting  approximation 
2 

1  + o.5x  was  used.  The  correct  alternation  behavior  came  up 
after  one  or  two  iterations. 

So  it  appears  to  be  better  to  comment  on  the  shortcomings 
of  the  algorithm  : 

(1)  There  may  be  only  local  convergence.  But  for  examples 
satisfying  the  conditions  of  Theorem  2 the  algorithm  seems  to 
be  globally  convergent,  though  it  often  creeps  along  due  to 
small  values  of  A when  started  far  away  from  the  best  approxi- 
mation . 

(2)  There  may  be  no  best  approximation  to  f in  y.  Then  the 
algorithm  moves  very  slowly  to  the  boundary  of  y ; the  conver- 
gence may  even  be  poor  when  one  is  far  away  from  the  boundary. 
This  is  due  to  the  fact  that  the  approximations  from  Tg  ignore 
the  convexity  needed  for  approximations  from  y. 

(3)  There  may  be  cases  where  f has  a best  approximation  s in  y 
but  (3)  does  not  hold.  We  did  not  try  specially  constructed 
functions;  for  the  commonplace  functions  we  tested,  either  (3) 
came  out  to  be  satisfied  or  the  situation  (2)  occurred. 
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MAXIMUM  NORM  ERROR  ESTIMATES  IN  THE  FINITE  ELEMENT  METHOD 
FOR  POISSON  EQUATION  ON  PLANE  DOMAINS  WITH  CORNERS 

A.  H.  Schatz  and  L.  B.  Wahlbin 

Estimates  are  given  for  the  error  in  the  L -norm  of  finite 

00 

element  approximations  of  domains  with  corners. 


In  this  lecture  we  shall  discuss  some  particular  cases  of 
results  which  were  recently  obtained  by  the  authors.  Since 
the  precise  statements  of  these  results  are  somewhat  techni- 
cal, we  shall  restrict  ourselves  here  to  a sketch.  A more  ge- 
neral and  detailed  account  with  proofs  will  be  given  elsewhere. 

Let  H be  a bounded  simply  connected  domain  in  the  plane 
whose  boundary  9ft  consists  of  a finite  number  of  straight  line 
segments  with  interior  angles  0 < _<  •••  _<  < 2tt,  i.e. 

ft  is  a polygonal  domain  which  may  have  a number  of  slits.  Con- 
sider Dirichlet’s  problem 

(1)  - Au  = f on  ft 

(2)  u = 0 on  9ft  . 

For  simplicity,  let  f be  smooth  on  ft.  In  this  case  it  is  well 
known  that  u is  smooth  away  from  the  corners  but  in  general 
may  be  badly  behaved  near  the  corner  points.  Let  us  suppose 
that  the  vertex  of  the  corner  is  located  at  the  origin 

(see  Figure  1). 

Then  for  R sufficiently  small  (where  (R,0)  are  polar  co- 
ordinates) the  "singular  part"  of  u behaves  like  (c.f.  [9]) 

Preceding  page  blank 
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TT 

a . 

(3)  C.R  J sin(— 6)  (£nR)q, 

where  q = 0 if  a.  ^ tt/m,  m > 2 an  integer,  and  q = 1 otherwise. 


Figure  1 

We  wish  to  approximate  u using  the  finite  element  method. 

To  this  end  let  0 ^ h 1 be  a parameter  and  let  S^(fi)  denote 

1 °1 

a family  of  finite  dimensional  subspaces  of  W ^(fi)  H W^Cfi). 

Here  W1"^)  denotes  the  usual  Sobolev  space  and  w (ft)  is  the 
P m P 

closure  of  C ( ) in  the-  norm,  of  W (.,).  The  approximate  solu- 
h°  p 

tion  U,  £ S i-  t : * i-  the  unique  solution  of  the 
h 

Ritz  Galerkin  Kc 

f t dx,  for  all  <p  £ S , 
or 

(5)  D(u  - ii  , » ) » 0 forall$£S^. 

n 

We  will  investigate  maximum  norm  estimates  for  the  error 

u(x)  - u,  (x)  as  h tends  to  zero,  where  S^*  is  chosen  to  have 
h 

properties  which  are  shared  by  many  subspaces  used  in  practice. 
Rather  than  detail  the  abstracted  properties  we  need,  we  con- 
fine ourselves  here  to  listing  two  examples  of  classes  of  sub- 
spaces which  satisfy  our  hypothesis.  In  what  follows  r > 2 is 
a given  integer. 
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Example  1.  For  each  0 < h < 1 let  be  a quasi-uniform 
triangulation  of  ft  with  h the  maximum  length  of  any  side  of 
any  triangle  t £ T^.  We  define  S*1  as  the  subspace  of  C^(ft) 
consisting  of  those  functions  which  vanish  on  3ft  and  which, 
when  restricted  to  each  triangle  x,  are  polynomials  of  degree 
r - 1. 

Example  2.  Suppose  the  boundary  segments  of  the  domain 
are  parallel  to  the  co-ordinate  axis.  Then  it  is  often  pos- 
sible to  define  S^(ft)  in  the  following  way: 

(i)  S^(ft)  is  a subspace  of  the  restriction  to  ft  of  ten- 
sor products  of  piecewise  polynomials. 

(ii)  The  functions  in  S^(ft)  vanish  on  3ft. 

Roughly  speaking,  r is  the  optimal  order  of  h which  a 
r °1 

function  in  W (ft)  f)  W„(ft)  can  be  approximated  in  the  maximum 

^ h 

norm  by  a function  in  S . With  regard  to  very  recent  work  on 
maximum  norm  estimates  we  refer  the  reader  to  Douglas,  Dupont, 
and  Wahlbin  [3],  Natterer  [5],  Nitsche  [6],  Scott  [8]  and 
Schatz  and  Wahlbin  [7],  where  other  relevant  references  may  be 
found.  Concerning  the  finite  element  method  for  domains  with 
corners  see  Babuska  and  Rosenzweig  [2]  and  Strang  [9]  for  es- 
timates in  based  norms. 

Since  (even  for  smooth  f)  the  smoothness  of  the  solution 

u may  differ  near  each  corner  and  again  in  the  interior  of  ft, 

we  wish  to  investigate  the  error  u(x)  - u,  (x)  locally.  To  this 

h 

end,  let  ft^ , j = 1,«*»,M,  be  any  neighborhood  (in  ft)  of  the 
j t^1  corner  such  that  ft^  does  not  contain  any  other  vertex  of 
ft  and  let  ft^  be  any  subset  of  ft  such  that  ft^  does  not  contain 
any  vertex  of  ft  (i.e.  avoids  the  corners).  We  then  have  the 
following  error  estimates: 

THEOREM  1 . Let  e > 0 be  arbitrary  but  fixed.  There 
exists  a constant  C such  that  for  h sufficiently  small  the 
following  hold: 
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i 

► 


(6)  | u - u 


h1  1 °° 

L (fi  ) 


min(-rr/a.  ,r,2ir/a  )-e 

< Ch  J M j=l»  * * * »M, 


and 

(7) 


u - u. 


< Ch 


min(r  ,27r/aw)-e 
M 


l (fiQ) 


To  illustrate  these  estimates,  let  fi  be  the  L shaped  re- 


= 3/2tt.  Then  for  any  of  the  subspaces  S*1  given  in 
examples  1 and  2 we  have  that  away  from  the  re-entrant  corner, 

(8)  I I u - u | | < Ch4/3_£  j=0,l, • • • ,6. 

L (fij  ) 

In  any  neighborhood  of  the  re-entrant  corner 

(9)  I I u - u | | <Ch2/3'e. 

L (fi6) 

In  this  case  the  estimates  (8)  and  (9)  agree  with  those  found 
in  computer  experiments  shown  to  us  by  Babuska  [1]  (cf.  also 
[9]).  Let  us  note  that  the  estimates  (8)  and  (9)  take  into 
account  the  worst  possible  behavior  of  u near  the  corners.  In 
the  case  of  the  L-shaped  region  Douglas,  Dupont,  and  Wheeler 
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[4]  have  shown  that  if  the  solution  u is  sufficiently  smooth 
everywhere  on  ft  and  if  for  S*1  one  takes  the  tensor  products  of 
continuous  one-dimensional  piecewise  polynomials  (a  special 
case  of  example  2),  then  in  maximum  norm  the  rate  of  conver- 

Y — 

gence  is  order  h everywhere  on  ft.  We  can  show  that  if  ft  is 
a convex  polygon  and  S^1  is  taken,  for  example,  to  be  the  sub- 
spaces given  in  examples  1 and  2 then  the  convergence  is  of 
order  h (for  sufficiently  smooth  solutions). 

Returning  to  the  general  case,  let  us  now  assume  that  the 

coefficient  C in  the  "singular  part"  of  u,  (3)  (associated 
M 

with  the  maximum  interior  angle  a,.)  is  different  from  zero. 

M 

To  simplify  the  discussion  here,  let  us  assume  tt  < < 2tt  and 

a , < a . The  estimates  (6)  and  (7)  then  predict  a lower 
M-i  M th 

rate  of  convergence  near  the  M corner  than  elsewhere.  It  is 
easy  to  see  that 


-+  e 


> ch 


L (ft) 


where  e > 0 is  arbitrary  and  c = c(e)  > 0.  This  suggests  that 
perhaps  the  maximum  pointwise  error  must  occur  near  the  vertex 
of  the  maximum  interior  angle.  Under  the  above  assumptions  we 
can  prove  the  following: 

THEOREM  2.  Let  6 > 0 be  arbitrary  but  fixed.  For  each  h 

let  x £ ft  be  such  that  |u(x  ) - u (x  ) | = | | u — u | | 

" L°°(ft) 

Then  for  h sufficiently  small 

l I u1"5 

Uh-*MI  ih  . 

where  x^  is  the  position  of  the  vertex  of  the  maximum  interior 
angle. 

Finally,  let  us  briefly  mention  some  other  results. 

1)  Calculation  of  the  coefficient  Cw  in  (3).  Let  us  assume 

M 


545 


A.  H.  SCHATZ  AND  L.  B.  WAHLBIN 


that  tt  < < 2tt.  The  calculation  of  a good  approximation 

to  of  importance  in  some  physical  problems.  For  fixed 

0 < 0 < aw,  one  can  define 
M 


where  d 
small , 


= 8) 

M tt/u 


^ sin (—  0 ) 
aM 


2 n/  (a  -Hi ) 

h . It  can  be  shown  that  for  h sufficiently 


211  ( 
I 


- u/(«M  + n)) 


2)  Refinements . In  order  to  improve  the  rate  of  convergence 
one  may  take  a finer  mesh  near  the  corners.  We  have  partial 
results  in  this  direction. 

3)  Singular  functions.  Another  way  to  increase  the  accuracy 
of  the  approximate  solution  is  to  add  to  S*'  certain  functions 
which  mimic  the  behavior  of  the  solution  near  the  vertices. 

We  also  have  partial  results  for  this  method. 
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Karl  Scherer 

Some  inverse  theorems  for  best  L -approximation  by  A- 
splines  are  presented,  including  thosi  of  a recent  paper  of 
Johnen/Scherer  for  sequences  of  (strongly)  mixed  and  nested  par- 
titions of  knots,  as  well  as  the  corresponding  one  for  the  se- 
quence of  equidistant  partitions  and  a weak  saturation  theorem 
for  general  sequences  of  partitions. 

1 Introduction 

The  purpose  of  this  note  is  to  complete  the  discussion  in 
[4]  where  the  following  inverse  theorem  was  proved: 

r a i 00 

THEOREM  1 . Let  b£  £i  strongly  mixed  sequence  of  par- 

titions of  [a,b] . Then  there  exists  £ constant  c>o  such 

that  for  all  f e L (a,b)  and  all  kik 
p ’ o 

(1)  K(A,  ,f ;p,A)^c  sup  E^  (f;p)  (l^p$°°) 

l»k  A1 

In  case  p = 00  the  assumption  "strongly  mixed"  can  be 
weakened  to  "mixed".  Here  the  best  approximation 

(2)  E^A)(f  ;p)  = inf  ||  f-s|| 

seSp(A,A)  p 

is  introduced,  where  Sp(A,A)  denotes  the  class  of  A-spline- 
functions  s which  belong  to  the  nullspace  N^  of  the  linear 
differential  operator 
n- 1 

(3)  A = Dn  + l a.DJ  (a.  e CJ [a,b]) 

j=o  ^ ^ 

on  each  subsegment  1^  = (x.,x^+|)  (i*o, . . . ,N- 1 ) of  a given 
partition 

(4)  A : a = xo<Xj<. . .<xN1<xN  = b 

with  A ■ max  (x.-x.  A = min  (x.-x.  It  should  be 

HUN  1 1‘l  - UUN  1 1-1 

Preceding  page  blank 
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remarked  that  smoothness  of  the  A-splines  is  not  required 
since  this  would  give  no  better  results  in  (1). 

The  second  quantity  in  (1)  is  defined  for  each  f e Lp(a,t) 
and  o<t<°°  by 

(5)  K(t,f;p,A)  = inf  { | j f-g  1 1 + tn||  Ag||  : g e w"(a,b)}  , 

n * r r | 

where  Wp(a,b)  denotes  the  usual  Sobolev  space  of  order  n. 

In  view  of  the  fact  that  (cf.  [4J) 

(6)  lim  inf  t nK(t,f;p,A)  = 0 ~=  > f e N^, 
t+o 

inequality  (1)  establishes  in  particular  that  such  a sequence 

r (A)  , °°  # # — n — — 

{E^  (f ;p))1  = j is  saturated  with  order  A^  (if  A^A^+j), 

that  is  that  E^A^(f;p)  = 0 ( A ^ n)  for  l-*»  implies  f e 

or  E^A) (f  ;p)  = 0 for  all  1 . 

1 

The  condition  "mixed"  imposed  on  the  sequence 

requires  that  there  is  a number  k such  that  for  k*k  there 

o o 

holds 

(7)  sup  min  J.(k,l)^dA, 

Uk  i 1 

with  a constant  independent  of  i,k,l  where  f. (k,l)  = 
dist(t^.A^)  and  t.  is  the  i-th  knot  of  A^. 

Strongly  mixed  means  that  in  addition  the  sup  in  (7)  can 
be  extended  over  only  a finite  number  (fixed  with  respect  to  k) 
of  l's. 

It  is  easy  to  construct  simple  examples  of  strongly  mixed 
sequences  of  partitions.  However,  it  could  not  be  shown  in  T4] 
that  the  sequence 

(8)  A*  = {tt  N ■ i/N,  o<i<N) 

of  equidistant  partitions  is  strongly  mixed,  so  Theorem  1 ap- 
plies to  this  case  only  for  p=co.  In  this  paper  it  will  be 
shown  that  (the  unit  interval  (0,1)  is  considered) 
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(9) 


K(l/N,f;p,A)  *< 


sup  E.*  (f;p). 
N^Mr2N  >1 


A still  sharper  result,  involving  on  the  right  side  only 
M = N,N+I,  can  be  proved  for  polynomial  splines  or,  more 
generally,  for  A with  constant  coefficients  a.  (see  [5]). 


2 Proof  of  the  results 


One  begins  with  the  inequality 
(10)  K ( t , f ;p , A)  £ ||  f-sN||p  + K(t,sN;p,A) 

where  is  an  element  of  S^(A*,A)  of  best  approximation 

to  f tL  (0,1)  (for  convenience  the  case  (a,b)  = (0,1)  is 

considered).  By  a lemma  in  [A]  one  can  further  estimate 
N 


K(t,sN;p,A)  C Cf  l (1+t  £ P)e 


i=  1 


max 
ock<  n-1 


1/p 


with  a constant  c depending  only  on  A and  p.  The  [sl^ 
are  the  jumps  of  the  k-th  derivative  of  s at  the  mesh  points 
x^  and  c is  a positive  number  < A£ 1 . Thus,  taking  £ = t 
= 1 /N,  one  has 

K(l/N,s  ;p,A)  * C{  l max  N~(kp+1 ) | [s  ]^|P} ' /p. 

i=  1 o<k£n+ 1 1 

By  a Markov-type  inequality  for  elements  of  N(A)  on 

arbitrary  subintervals  of  (0,1)  one  obtains  for  N<M£2N 

’ 1 k 
5N  l 


tsj-  ■ I I®N  - s„l-l  * c^(N,M)_k_l/p||  SN-SM||  p i , 


denotes  the  norm 


where  #.(N,M)  = d ist  ( tt  „ ft  ) and  * 
with  respect  to  the  subinterval  (t?  ^ - y. (N,M),  t?  (N,M)) 

Thus  it  follows 


(II)  K(l/N,sN;p,A)  .< 
N 


C max 

o^kCn-1  i= 1 


{ l N"(kp+I)  ^.(N,M)'(kp+1) 


SN  ~ SM^  p , i ^ 


1 /p 


Now  one  splits  up  the  sum  into  three  parts, 

N CN/3  4 2N/3  N 

02)  l - l * l * l = + 

i = 1 i= I i^N/3  i>2N/3 
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1 


say.  To  handle  the  first  sum  one  uses  a mixing  property,  ob- 
served by  Butler/Richards  [2]  , 


(13)  N~ 

(Bp+l) 

£ C 

i 2N 

min  V 1 

o<  t*  <6  M=N+ 1 

J.N 

for  all 

N^N 

o 

and 

3>o,o<6<1/2,  the  constant  C 

depending 

only  on 

6 and 

6. 

Inserting  this 

with  6 = k+ 1 

gives 

x<N/3  2N 

(14)  Z £ C l (1/N)  l NP$.  (N,M)P  ||  sM  - s 

i=  1 M=N+ 1 1 N 


2N 

* (C/N)  l 

M=N+  1 


N 


M 


P < c max  ||  s 
H N£M£2N 


Inequality  (13)  is  also  valid  for  all  tt  N e (1-6,1)  by 

J » N 

symmetry  so  that  the  same  estimate  as  (14)  also  holds  for  Z^. 


Concerning  the  middle  term  Z.^  one  proceeds  somewhat 
different.  In  (11)  one  chooses  M = N + 1 and  verifies  readily 
that  f^(N,N+l)  > 1/3(N+1),  so  that  one  obtains,  replacing  the 
sum  in  (15)  by  the  sum  over  N/3^i£2N/3, 


£2N/3 


(15>  L,*C(  I lfSN  - SM||P  .)'/PSC||SN-SNtl|| 

iSN/3  p,i  v 

Combining  (10)— (12)  and  (14),  (15)  gives  the  desired  inequality 


(9). 


Together  with  a direct  theorem  on  best  approximat ion  by 
A splines  (cf.  [ 4]  ) it  is  now  possible  to  state  the  following 
approximation  theorem 

THEOREM  2.  Let  §(t)  be^  _a  non-decreasing  function  on  (0,1) 

with  lim«^(t)  = 0.  Then  for  f e L (0, 1 ) , Up£°°,  there  holds 
t+o  P 

E<A)(f;P)  = 0( $( 1 /N) ] ,N x»,  if  and  only  if  K(t,f;p,A)  * 

“ 0[$(t)J  , t-K). 
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In  particular,  a saturation  theorem  holds  for  3(0  = 
0(tn),t~K),  in  view  of  (6).  For  the  other  nontrivial  choices 
of  J(t),  a characterization  of  the  above  assertions  in  terms 
of  Lipschitz  spaces  can  be  obtained  (cf . [4j). 


For  general  sequences  of  partitions  a unified  treatment 
of  inverse  theorems  is  not  possible.  In  particular  the  case 
of  sequences  of  nested  partitions  shows  quite  different 
features.  However,  if  one  assumes  sufficient  smoothness  of  the 
function  to  be  approximated,  a saturation  theorem  can  be  prov- 
ed (the  special  case  of  nested  partitions  has  been  considered 
in  [4]  . 


THEOREM  3.  Let  f e Wn(a,b)  , l<bU°°,  and 

P (A'l 

partitions  of  fa,b] . Then  E,  '(f;p)  = 

AN 


a sequence 
implies 


f e N(A) . 


Proof.  According  to  (10)  and  lemma  in  [4]  one  has 


(16)  K(t,f  ;p,A)£E^  (f  ;p)+C{  £ (l+tnenp)e  max  | ek [s]^ | p} 1 

N i= 1 o^k^n-1 


for  e<A,  where  s is  an  element  of  best  approximation  to  f. 

Ic 

In  order  to  estimate  the  jumps  at  the  point  x^  we  need 

the  fact  that  for  each  subinterval  (e,d)  of  (a,b)  there 
holds  the  Taylor  formula 

b 

(17)  f(x)  = ue(x)  + / v(x,£)Af (£)d£ 

e 

with  u^  e N(A)  satisfying  Ug^  (e):  = f (e)  ,o<  j^n-1  and  the 

n 

kernel  satisfying  v(x,£)  = £ u (£)u  (x)  for  as^x*b  and 

k=l  K K 

v(x,£)  “ 0 for  a<x££*b.  By  the  Taylor  expansion  of  v(x,£) 
(cf . [3])  it  follows  for  x e (e,d) 

Dx^(X,e)  = (x_e)n  1 / (n-j-1 ) ! + (J((x-e)n~j) . 
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Hence  one  obtains  readily,  using  Holder's  inequality,  that 

(18)  [J  |f(x)-ue(x)|pdx}1/p.<c|d-e|n{e/  |AfCO;  Pdr} 1 /p 

with  a constant  not  depending  on  f or  e,d.  Now,  denoting 
by  u^(x)  the  element  of  N(A)  in  the  sense  of  (17)  at 
e = x.-c,  one  obtains  by  a Bernstein-Type  inequality  and  (17) 


sgp  |s<k>(x)-U<k)<*>|)p 

i 'i 


4C  s-u . 


i "P,i 


-<c{  II  — 'Hp.i*  «"ll«1lp.^p 


where  • . denotes  the  L -norm  with  respect  to 

11  MP,i  P 

(*£-£  ,X.+e)  . 

Inserting  this  into  (16)  it  follows  for  any  li'q^p^* 

K(t,f;q,AKC{l  + tne_n}E^>(f;q)+Ctn{  £ ((  Af  (( q . } 1 /q 

N i=l  q>1 

CC{  l+tn£_n}E(A)  (f;p)  + Ct\lArl/p||  Af  ||p  . 

N 

Now  let  E.^‘^(f;p)  = a!1  • tp (N)  with  some  positive  function, 

N 

i(j(N)-K)  for  N-*®,  Then,  for  N sufficiently  large,  take 
£ = ijj(N)*^nA^  so  that 

A"NnK(AN  ,f;p,A)  * C(i|/(N)  + i^(N)  1 /2}  + (^(N)  1 /2nA  N ) 1 /q_  1 /p . 

But  by  a lemma  in  [A]  this  implies  ftN(A). 

Theorem  3 generalizes  a corresponding  result  in  [l,p,I74] 
for  polynomial  splines  and  p = °°.  It  shows  that  functions 
which  are  to  be  approximated  by  A-splines  with  arbitrary 
high  order  of  convergence  cannot  be  found  within  the  class 


4 


4 
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Wp(a,b)  of  smooth  functions.  On  the  other  hand,  Theorem  3 
in  [4]  implies  - at  least  for  sequences  of  nested  partitions  - 
that  in  case  of  smooth  splines  in  W^(a,b) , l£m£n-l  such  func- 
tions of  high  order  of  approximation  must  lie  in  the  class 
W^(a,b) . 


In  summary.  Theorems  1-3  show  that,  concerning  the  order 
of  best  approximation,  the  situation  for  polynomial  splines 
and  the  more  general  A-splines  is  the  same.  This  fact  is 
already  heuristically  explained  by  the  fact  that 
A = an°n  + (•••)» an>°»  a "perturbation"  of  Dn. 
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A GENERALIZATION  OF  A RESULT  OF  SUBBOTIN 


A.  Sharma  and  J.  Tzimbalario 

The  object  of  this  note  is  to  generalize  a result  of 
Subbotin  and  find  the  relation  between  some  linear  differen- 
tial operators  and  the  corresponding  generalized  differences. 

1 Introduction 

oo 

Let  n be  a fixed  positive  integer  and  let  {y,  } be 

k -» 

a given  sequence  of  real  numbers  having  bounded  n-th  order 
differences.  About  ten  years  ago,  Subbotin  [4]  proposed  and 
solved  the  problem  of  finding  a function  f (x)  on  the  whole 
real  line  which  interpolates  the  data  {y  } at  the  integers 
and  has  a least  n-th  derivative  in  the  sup-norm. 

Our  aim  is  to  study  a corresponding  problem  when  the 
n-th  derivative  is  replaced  by  certain  linear  differential 
operator  /.^(D)  and  the  n-th  order  differences  by  suitable 
divided  differences.  Following  a remark  of  Schoenberg  [2] 

"for  an  interesting  and  viable  theory,  we  require  also  trans- 
lation invariance"  and  so  we  restrict  ourselves  to  differen- 
tial operators  with  constant  coefficients  such  that  its 
characteristic  equation  has  only  real  roots.  In  this  case 
the  extremal  function  turns  out  to  be  a cardinal  L-spline 
which  has  been  studied  recently  by  Micchelli  [1]  and 
Schoenberg  [2], [3]. 

In  §2,  we  introduce  some  notations  and  formulate  the 
problem.  We  also  give  some  properties  of  the  zeros  of  certain 
polynomials  which  are  needed  in  the  proof  of  our  main  result. 
We  omit  the  details  of  the  proof. 

Preceding  page  blank 
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2 Preliminaries 

Let  0 = Y»,Y, be  real  constants  and  let  L (D) 

^ 0 1 n n 

and  L , , (D)  be  given  by 
n+I 

(2.1)  Ln(D)  - H (D-y  ),  t;+1(D)  - D Ln(D>,  D 

1 

°o 

For  a given  sequence  {y^l  set 

n y*?  , -I 

(2.2)  A"  y = n (E-e  3)y  , A^y  = (E-l)A^  y 

L m ^ m m tm 

where  Ey  = y If  a real  function  f(x)  defined  on  the 

m -^m+l 

real  line  satisfies  the  condition  sup  1 (D ) f ! < M,  then  it 

1 n 1 — 

can  be  easily  seen  that  sup^jA^  f (m) | is  bounded.  We  shall 
be  concerned  in  the  sequel  with  the  following  problem: 


PROBLEM. 

dition 


the  con- 


(2.3)  sup  | A?  y | < 1, 

, , L m — 

m e Z 

find  a_  function  f (x)  satisfying  the  following  conditions : 

(2.4)  f(k)  = yk,  k e Z 

(2.5)  II  L (D ) f 1 1 = sup  |L  (D)f(x)|  is  minimum 

n °°  n 

x 

when  f runs  over  the  functions  satisfying  (2.4). 

For  the  solution  of  this  problem  we  shall  need  some 

auxiliary  functions  and  their  properties.  Let  (x)  be  the 

n 

unique  solution  of  ln(D)y  = 0 satisfying  the  conditions 

(2.6)  <t>n(v)(0)  = 6 v = 0,l,*“,n-l. 

n v ,n-l 

Set 

(2.7)  (x)  = f <(>  (t)dt. 

n Jo  n 
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Then  i^n  (x)  is  the  unique  solution  of  Ln+^(D)y  = 0 

determined  by  the  conditions  0^V^(O)  =6  (v  = 0,1, •••tn). 

n vn 

n YiX 

Since  A^  e J =0,  j = l,***,n,  it  follows  easily  that 

,00 


(2.8) 

where 


.n 


A^  f(x)  = 


J0 


A?  (<()  (x-t)  ) L (D)f(t)dt 
L v n +'  n 


A^  applies  to  the  variable  x and 


(2.9)  <b  (x-t),  = <j>  (x-t),  for  t < x,  and  = 0 otherwise. 

n + n — 

* 

We  now  introduce  the  function  n (x;a)  where 

n 

00 

k n 0 n+1 

(2.10)  II  (x;a)  = J x A ^(£-n-a)  . 

n o i * + 

£■=-“  l i 

* * 

As  a function  of  a,  II  (x;a)  is  a cardinal  L -spline  and  as 

n 

a function  of  x,  it  is  a polynomial  of  degree  n when 
0 < a < 1 and  is  of  degree  n-1  when  a = 0.  It  is  clear 
that 

(2.11)  n*(x;a+l)  = x n*(x;a). 

n n 

■k 

In  the  language  of  Schoenberg,  IT  (x;a)  is  the  "exponential 
* n 
L -spline  of  basis  x . Following  a result  of  Micchelli  ([1] 

Theorem  2.3  and  Coro.  2.4),  it  turns  out  that  for  a given  a, 

k 

0 < a < 1,  all  the  zeros  of  II  (x;a)  are  simple  and  negative. 

— n 

(For  a different  proof  of  this  we  refer  to  Schoenberg  [2]). 

Micchelli  showed  that 


(2.12)  A* (l-a;  -)  = (-1)°  Xx  A*(a;x). 
n x n 


* * " \ 

Here  II  (x;a)  = A (a;x)  II  (e  -x) . From  this  it  is  easy  to 
n n 

verify  that  ° 


(2.13) 


x TI  x;  — , n even 
n v 2' 

xn  1 n (x;  0),  n odd. 
n 
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It  follows  from  this  that  for  n even  JI  (— 1 ; 0 ) = 0 and  for 

* , In  * n 

n odd  II  (-1;  — ) = 0.  Since  Jl^C-ljot)  has  only  one  simple 
zero  in  [0,1)  ([1],[2]),  it  follows  that 

n (-15  x)  ^ 0,  n even 

(2.14)  n Z 

II  (—  1 ; 0 ) ^ 0,  n odd. 
n 

We  shall  also  require  later  the  exponential  polynomial  P (v) 
given  by 

00 

(2.15)  Pn(v)  = l (-l)3  a"  <fr  (-j+l-v)+  . 

j=-°° 

Comparing  (2.15)  with  (2.10),  we  see  that  the  polynomial 

P (v)  corresponds  to  the  operator  L and  that  P (v)  = 
n n 

IT  (-l;v)  in  analogy  with  the  notation  of  (2.10).  It  follows 
n 

mutatis  mutandis  that 

(2.16)  P (1-v)  = (-l)n+1P  (v). 

n n 

Also  the  only  zero  of  P (v)  in  [0,1)  is  a simple  zero  at 
1 n 

— when  n is  even  and  is  a simple  zero  at  0 when  n is 
odd . 

When  X = X^  = •••  « X = 0,  the  exponential  polynomial 
P^(v)  reduces  to  the  Euler  polynomial  except  for  a constant 
factor.  It  follows  from  a continuity  argument  and  from  a 
property  of  the  Euler  polynomials  that 

[—1 

(2.17)  |Pn(v)|  = (-1)  2 Pn(v),  | < v < 1. 

3 Main  Result 

We  shall  give  a complete  answer  to  the  Problem  when  the 

operator  L^(D)  is  formally  self-adjoint,  that  is, 

L (-D)  = (-l)n  L (D).  In  other  words  the  sets  (y, , • • • ,y  } 
n n In 

and  (-y^, * * • »_yn)  are  equal.  We  then  have  the  following 
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THEOREM  1.  Let  the  numbers  y,»***»y  be  such  that  the 
— 1 n 

operator  (.^(D)  i_s  formally  self-adjoint . Then  for  each 

oo 

sequence  {y,  } satisfying  (2.4),  there  is  a function  f (x) 

K — oo  " — 

satisfying  (2.5)  such  that 

sup  J L (D)f (x) | < A . . 

1 n ' — n,L 


Here 


A 

n. 


L 


c l 

• o 


JL 

P (v) Idv 
n ' 


(-1) 


[-] 
1 2 J 


n (-i;a) 

n 


where  a = — when  n is  even,  and  a = 0 when  n is  odd. 

^ * 

Here  P (v)  and  n (-l,a)  are  given  by  (2.15)  and  (2.10) 
n n — 

respectively.  The  constant  A^  ^ ijs  sharp . 

In  the  special  case  when  y = y = • • • = y = 0,  the 

12  n 

result  was  proved  by  Subbotin  [4].  In  order  to  prove  this  we 
first  show  that 


|P  (v) Idv 

' n ' 

tfi 

(-D  z 

and  then  prove  that  A — . Finally  we  check  that 

n’  nn(-i;a) 

the  two  bounds  are  equal.  The  details  will  appear  elsewhere. 


4 Conclusion 


In  the  above  the  lower  estimate  for  A . remains  valid 

n,  L 

even  when  the  operator  (.^(D)  is  not  formally  self-adjoint. 

The  first  non-trivial  case  occurs  when  n = 2.  In  this 

2 2 

case  using  (2.10),  it  follows  easily  that  if  (.(D)  = D - y , 


Y 2 Y 

then  8 cosh  — sinh  , A„  , 
2 4 2 , L 


Y . The  exponential  polynomial 
Pn(v)  can  be  looked  upon  as  a generalization  of  the  Euler 
polynomials  to  which  it  reduces  when  y^  = ^ = ••• 
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i 

i 

Yn  = 0 except  for  a constant  factor.  It  would  be  interesting 
to  know  if  other  properties  of  Euler  polynomials  can  be 
carried  over  to  P (v). 

n 

It  is  possible  to  propose  a more  general  problem  by 

replacing  (2.4)  and  (2.6)  by  inequalities  in  SL  and  L 

P P 

norm  respectively.  It  appears  that  the  solution  to  the  more 
general  problem  can  be  obtained  on  the  lines  of  Subbotin 
([5], [6]),  in  the  light  of  the  above  result. 
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ON  THE  SMOOTHNESS  OF  LOCAL  BEST  L SPLINE  APPROXIMATIONS 

P 

Philip  W.  Smith 

This  paper  is  concerned  with  best  and  local  best  L 

P 

approximation  by  splines  with  variable  knots.  In  particular 
it  will  be  shown  that  local  best  L^  , 1 < p < <*>  , spline 

approximation  to  a continuous  function  always  yields  a 

continuous  function.  Such  is  not  the  case  in  L . Con- 

00 

ditions  are  also  obtained  on  the  nature  of  the  possible  dis- 
continuities when  approximating  certain  discontinuous 
functions.  These  results  extend  and  amplify  those  in  [3,  4] 
and  [ 2 ] . 


1 Introduction 


The  notation  we  employ  here  will  be  easily  recognized  by 

those  familiar  with  spline  theory.  Let  a positive  integer 

k be  given  and  let  t denote  a knot  vector  satisfying 

t : 0 = t„  = ...  = t.  < t. t < t ,,=...=  t 

k k+1  - 


o = t1  = 


= 1 with 


Vk  > Cj 


for  j = 1, 


< t < t - ...  ^ 

- n n+1  n+k 

, n.  The  normalized 


B-spline  N^  corresponding  to  the  knot  vector  t is 

defined  by  »1>k<t,T>  - [t  ^ , 1 1+k) 

1 < i < n (cf.  [1]).  We  denote  the  set  of  splines  of 
order  k with  n - k or  fewer  knots  (counting  multiplici- 
ties) in  (0,1)  by  . It  is  well  known  [1]  that  s e 

n n 

n 


if  and  only  if  s(’)  = £ AN  , (t,*) 

i=l  1 1»k  ’ 

vector  t and  some  coefficients  A. 


for  some  knot 


Given  a function  f e L [0,1] 

P 

a best  L [0,1]  approximation  to  f 


s e S 


inf { | | y 


-4 1, 


k 

y e S } 

n 


a local  best  L^[0,1]  approximation  to  f 


we  will  call 

if  I f s-f | I 

Similarly  we  will  call  s e S 
if 


n 

for  some 
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e > 0,  | | f — s | | = inf { | | f — y | | : y e and  | I y— s I I < e} 

ii  iip  ii  '''p  n 11  ''p 

2 Main  results 

n 

Let  f c L [0,1]  and  s(t,*)  = J A.  N.  , (t,-) 

P * 1 J,kv=’ 

We  will  denote  the  error  between  f and  s by 

e(t,t)  = f ( t)  - s(t,r).  For  any  function  g with 

left  and  right  limits  at  a we  set  [g] (a)  = g(a+)  - g(a-) 

The  first  theorem  relates  discontinuities  in  a local  best 

approximating  spline  tD  discontinuities  in  the  error. 


THEOREM  2.1.  Let  s(t,-)  e S 


approximation  to  f e L^ [ 0 , 1 ] 


be  a local  best 

1 < p < oo 


_ Lp[0,l] 

Assume  further 


that  for  some  a e (0,1) 
exist.  Then 


1 im  f ( t ) and  1 im  f ( t ) 

t-*a+  t->a~ 


(2.1)  i)  e (t , a-)  [s(t,-)l  (a)  < 0 . 
ii)  e(t,a+)  [ s (t , * ) 1 (a)  > 0 . 

Note  that  (2.1)  is  trivial  unless  a is  a point  of 

discontinuity  of  s.  Thus  we  may  assume  that  t < a = 

m-1 

t = . . . = t , < t where  the  {t.}  are  the 
m m+k-1  m+k  l 

components  of  the  knot  vector  t for  s.  The  idea  of  the 

proof  is  to  differentiate  the  error  functional, 

| [ e ( t , * ) | | P , with  respect  to  t and  t ,.  . . Since  it 
- p m m+K-l 

is  only  possible  to  compute  one-sided  derivatives  of  this 

functional  with  respect  to  t and  t , , we  obtain 

m m+k-1 

the  inequalities  in  (2.1).  These  results  for  1 < p < °° 
under  only  slightly  more  restrictive  hypotheses  were 
obatined  in  [2],  The  case  p = 1 is  new. 

It  is  now  possible  with  the  aid  of  Theorem  2.1  to 
obtain  results  concerning  the  continuity  of  local  best 
spline  approximations. 
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THEOREM  2.2.  I_f  1 < p < °°  and  f is  continuous  then  any 
local  best  [0,1]  approximation  from  _to  f 

is  continuous. 


Proof . Assume  to  the  contrary  that  f has  a discontinuous 


local  best  approximation  s(t,’)  with  a discontinuity 

at  t . Without  loss  of  generality  we  may  assume  that 

[s(t, *) ] (tm)  < 0.  This  implies  via  (2.1)  that 

e(t,t-)  > 0 and  e(t,t  +)  < 0.  But  then  0 < e(t,t  -)  - 
— m — -tu—  — « m 


e(t,t  +)  = [s(t,')]  (t  ) which  is  a contradiction  and 
* m = v m 

completes  the  proof. 


Setting  [a,b]  to  be  the  comvex  hull  of  a and  b 
we  may  now  state  a result  on  discontinuous  spline  approxima- 
tion. 


THEOREM  2.3.  Let  s ( t , • ) be  a local  best  approximant  to 

f from  S . If  s is  discontinuous  at  t then 
n — m 

[ s ( t , t m“ ) , s(t,tm+)]  C [f(t-),  f(t+)]. 

The  proof  follows  quickly  from  Theorem  2.1. 
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APPROXIMATION  BY  REGULAR  SPLINES  WITH  FREE  KNOTS 


H.  Werner 

In  this  paper  the  concept  of  regular  splines  S of  degree 

k with  m- I interior  knots  is  introduced  and  a survey’of  recent 

results  on  this  subject  is  given.  Then  new  results  for  Tscheby- 

scheff  approximation  by  these  splines  with  free  knots  are 

sketched,  the  closure  of  S , is  characterized  for  classes  of 
• m k. 

stiff  regular  splines  steep’at  Jhe  boundary ,^and  an  upper  bound 
for  the  number  of  zeros  of  s-s  , where  s,  s e S ^ is 
quoted  leading  to  the  length  of  a (sufficient)  al?£rnant. 

1 Introduction 


For  practical  purposes  it  is  useful  to  allow  for  approxi- 
mating functions  which  share  certain  properties  with  the  func- 
tion being  approximated  thus  motivating  the  definition  of  re- 
gular splines  which  sets  up  the  spline  from  portions  of 
(mildly)  nonlinear  families. 

Let  I = [a,B]  and  consider  subdivision  into  m non-zero 

intervals  I.  = !x.  , ,x.~]  , x = a,  x = B.  Hence  we  have  m-1 
J L J-l  Jj  o m 

interior  points,  called  knots  x.  < x„  < ...  < x Let  k 

1 1 m- 1 

specify  the  degree  of  smoothness,  \a  >_  2. 

Given  are  m classes  <?  . of  functions 

J 

k 

tj(x;  xj_|»  xj » C,  d)  £ C (Xj_,»  Xj) 
continuously  depending,  as  are  their  derivatives,  on  the  four 


parameters  xj_] 
open  intervals. 


Xj , c,  d,  where  c and  d vary  in  appropriate 
Sometimes  higher  differentiability  properties 


of  tj  are  needed.  (Note  that  the  intervals  Ij  may  vary.)  Then 

f i k 

S . :=  ulu  6 C (I),  3 I, ....  ,1  3 I.  c I such  that 
m,k  11  ’ 1 m j 

ulj_  = Pj  + tj  ’ Pj  6 ’P'k-l  » cj  e % j ’ J = 1 

describes  the  class  of  regular  splines  with  free  knots.  7^.. 
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denotes  the  polynomials  of  degree  k-1  or  less. 


The  class  ^ should  be  regular , as  first  introduced  by 

Schaback  £.13],  it  is  possible  to  parametrize  the  class  by 

(k) 

means  of  the  values  of  t at  two  different  points,  in 
particular  xj_j  anc*  xj  • other  words,  if  t*,  t e ? and 


Dk(t*-t) (z) 


for 


z - Vi 


and 


x., 


then  t (x)  = t(x)  in  rx.  ,,  x.l. 

L J-l  Jj 

Equivalently  the  differences  of  the  kth  order  derivatives  of 
two  functions  of  ^ can  have  at  most  one  separated  zero. 


Typical  examples  of  ^ are 

t°  = (d-x+c)-xk,  t'  = -£  , t^  = c.exp(dx),  = c log(x+d), 

Q X 

with  obvious  restrictions  on  c,  d depending  on  the  domain 
of  x.  If  c and  d are  arbitrary  reals,  t°  just  produces  the  poly- 
nomial splines,  but  with  other  restrictions  other  results  will 
arise . 

2 Survey  on  Interpolation 


The  above  properties  suffice  to  deal  with  the  interpolation 

problem,  data  prescribed  in  the  way  familiar  from  polynomial 

splines.  For  the  class  t we  obtain  the  so  called  special 

rational  splines  in  S . . For  k = 2 Schaback  H 2^  treated 

m,k  u J 

the  problem  extensively  and  established  uniqueness.  To  obtain 
existence  he  rewrote  the  interpolation  equations  as  variational 
equations  of  an  optimization  problem.  Recently  Baumeister  ]4j 
dualized  the  optimization  method  to  get  a new  definition  of 
certain  classes  of  regular  splines,  compare  also  Baumeister, 
Schumaker  [5]  , Schumaker  []l  5j  . For  sufficiently  fine  sub- 
divisions of  I the  interpolation  with  regular  splines  is 
closely  related  to  the  polynomial  spline  interpolation.  It  can 
be  treated  by  a perturbation  method  as  I showed  in  Werner  £l  8] 
for  k = 2;  the  general  case  was  solved  by  Arndt  fl]. 
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To  conclude  this  survey  it  should  be  added  that  regular 
splines  are  a powerful  tool  in  handling  initial  value  problems 
of  differential  equations,  compare  Runge  Pi  l]  , Werner  [18],  [19] 
and  boundary  value  problems,  compare  the  forthcoming  Brunei 
thesis  of  Ph.  Moore  ]lo] . In  this  case  the  freedom  of  the 
localisation  of  the  knots  is  particularly  valuable  although  with 
proper  choice  of  % one  may  do  quite  well  with  evenly  spaced 
knots . 

3 Tschebyschef f Approximation  with  free  knots 

To  develop  the  theory  of  T-Approximation  we  will  introduce 

two  additional  assumptions  upon  the  class  ? . The  function 

k . k 

t£  4 shall  be  stiff,  i.e.  D t > o,  respectively  D t < o.  This 
excludes  heavily  oscillating  functions  and  is  useful  in  con- 
sidering sequences  in  S , . 

m,k 

While  boundedness  of  the  function  t in  an  interval  guaran- 
ties boundedness  of  the  parameters  in  case  of  polynomials,  i.e. 
for  example  in  t°  , this  may  be  quite  different  in  other  cases. 
To  get  a realistic  picture  of  these  circumstances  we  analyze 
the  special  rational  splines  more  closely. 


Consider  the  one  parameter  family 


t(z,d)  :=  ck 

• = Ck 

d 

d-z 

- p(z 

,d). 

where  p(z,d) 

, k-1 

1 k- 

2 

k~ 

K 1 

}'Ck’  Ck  (k- 1 ) ! 

= (z 

+ z 

•d+.  . 

. ,+d 

in  z t [- 1 , o] 

, say,  and 

o < ' 

l/d  + 

1 < oo  # 

Observe  the  jump  of  the  (k-l)st  derivatives  at  z = o.  The 


1 
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analogous  situation  can  be  found  at  the  left  endpoint  of  the 
interval . 


We  say  the  class  t is 
is  a (k-l)st  degree  polynomial 

% 

t(x,d)  = t (x ; Xj_j , Xj , 1 , 

has  the  limits  given  in  (*)  if 
analog  is  true  with  z = x-Xj_|, 
should  enter  multiplicatively  f 


steep  at  the  boundary  if  there 
such  that 

d)  - p (x ; Xj_j , x. , d) 

z = x-x.  for  d -*■  o and  if  the 
J 

for  d ->•  00  , say.  The  parameter 
or  the  class. 


c 


To  investigate  existence  of  best  approximations  we  con- 
sider the  closure  of  ^ under  compact  convergence.  One  can 
even  show  the  compactness  of  bounded  sequences  of  S 

m j k 

If  J C (a, 6),  the  uniform  boundedness  of  any  sequence 

{s(v)}  in  I implies  uniform  boundedness  of  all  derivatives  up  to 

order  k-1  in  J due  to  the  stiffness  (which  implies  monotonicity 

of  'S(V)  and  boundedness  of  appropriate  difference  quotients. 

By  taking  subsequences  one  may  assume  that  a sequence, again 

-+  D 

(v) 


denoted  by  s^V\  has  been  found  such  that  D^s^V^  -*  D^s  uniformly 


in  J for  j = o,...,k-2;  furthermore,  that  lim  x 


J 


x.  exist, 
J 


i.e.  the  knots  converge,  as  do  the  parameters  d . -*■  d and 

v ^ 

c.  -*■  c.  for  v -+  °°  , where  the  limit  may  be  °°  . 

J J 

It  is  easily  seen  that  in  any  subinterval  of  J free  of  limit 

knots  Xj , the  limit  function  s is  either  equal  p^ , a polynomial 

of  degree  (k-1)  ("polynomial"),  or  Pj+tj  ("tegular").  So  it 

remains  to  investigate  the  behaviour  in  a neighbourhood  U 

of  an  interior  knot  x^ . If  s £ C1  '(U)\c1(U)  the  knot  is  called 

of  the  ith  kind.  A knot  is  n-f old , if  n limit  knots 

x.,...,x.  , coincide.  1-fold  knots  are  called  simple. 

J J+n-1  K 

1)  At  simple  knots  we  observe  the  same  phenomena  as  for  fixed 
knot  approximation,  compare  Werner  jj6,17].  If  at  x^  locally 
the  function  s is: 


i)  regular  at  both  sides  of  x^ , then  s e C (U) , 
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ii)  regular  at  one,  polynomial  at  the  other  side,  is 
of  kth  kind, 

iii)  polynomial  at  both  sides,  is  of  (k-l)st  kind. 

2)  At  two  fold  knots  we  may  think  of  one  subinterval  shrinking 

to  zero  length  and  in  this  interval  s becoming  polynomial, 

this  allows  an  extra  discontinuity  and  leads  to  the 

following  additional  new  cases.  If  at  x.  = x.  , the  function 

J J-l 

s is  locally: 

iv)  as  in  i)  above,  then  x^  is  of  kth  kind, 

v)  as  in  ii)  above,  then  x^  is  of  (k-l)st  kind. 


3)  Finally,  if  the  knot  x^  is  three-fold  (or  even  more)  then  we 
may  think  that  two  polynomial  intervals  of  zero  length  may 
appear.  This  makes  it  possible  that  s is  regular  at  both 
sides  of  Xj ,but  in  spite  of  this,Xj  is  of  (k-l)st  kind. 


Since  there  cannot  exist  (k-2)nd  kind  knots  due  to  the  conti- 
k-2 

nuity  of  D s,  no  other  cases  are  possible.  To  complete  the 
description  of  one  has  to  add  that  interior  polynomial 

sections  always  are  the  limit  of  at  least  two  regular  sections 
of  the  converging  sequence;  i.e.  it  seems  that  every  polynomial 
interval  absorbs  one  knot. 


If  we  define  the  sum  of  interior  knots  (counting  multipli 

city  as  1,2,  or  3)  plus  the  number  of  interior  polynomial 

sections  of  s as  its  order  ord(s),  one  has  to  work  out  that 

s e S ,\s  .of  order  k'  < k is  the  limit  of  functions  of 
m,k  m,k  — 

S , , under  compact  convergence, 
m , k 

These  statements  are  generalizations  of  the  results  for 

special  rational  splines  obtained  by  Schomberg  [j  4^  • It  is  now 

a straightforward  conclusion  to  establish  existence  of  best 
* . - . 

approximations  s in  S , but  a quite  different  matter  to 
find  out  when  s will  lie  in  S 

m,k 
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To  establish  sufficiency  criteria  for  best  approximations 

(as  length  of  alternants)  in  the  standard  way,  one  may  count 

zeros  of  differences  r = s-s,  where  s,s  e S , . The  consi- 

m,k 

derations  of  Schomberg  [j4]  immediately  carry  over  to  show  that 
this  number  (counting  multiplicities)  has  the  upper  bound 

ord(s)  + ord(s)  + k+1  - (common  knots  of  s and  s'). 

The  details  of  these  results  will  be  contained  in  a forthcoming 
report . 
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AN  APPROXIMATION  PROOF  IN  THE  THEORY  OF  COMPLEX  L -PREDUALS 

Daniel  Wulbert 


Abstract.  A short  proof  is  given  for  the  Hirsberg-Lazar  char- 
acterization of  complex  L^-predual  subspaces  of  C(X,  G) . The 

proof  depends  on  the  approximation  of  L^-preduals  by  certain 

finite  dimensional  subspaces,  and  uses  a Korovkin  type  approx- 
imation argument  about  the  convergence  of  contractive  projec- 
tions . 

I Introduction 

The  main  results  related  to  this  lecture  are  about  the 
structure  of  complex  L^-predual  Banach  spaces.  However  a 
preliminary  part  of  the  development  is  a short  proof  of  the 
Hirsberg-Lazar  characterization  of  such  subspaces  of  C(X,  G) . 

The  proof  depends  on  the  approximation  of  L^-preduals  by  j 

certain  finite  dimensional  subspaces.  The  Hirsberg-Lazar  proof 
uses  the  simplex  structure  that  E.G.  Effros  showed  was  asso- 
ciated with  a complex  L^-predual.  This  talk  will  present  the 
approximation  theoretic  approach. 

The  result  by  Hirsberg  and  Lazar  is  the  following: 

THEOREM  1.1  _If  E is  a subspace  of  C(X,  G)  whi ch  contains 
the  constants,  the  following  are  equivalent : 

(a)  E is  an  L^-predual 

(b)  E is  selfadjoint  and  ReE  is  a real  L^-predual. 

II  Preliminaries  and  Notation 
The  following  two  theorems  characterize  L^-preduals  by 
their  property  of  being  approximatable  by  subspaces  of  the  type 

Preceding  page  Hank 
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Both  results  apply  to  real  and  complex  Banach  spaces. 

THEOREM  2.1  .If.  E is  a separable  L^-predual  Banach  space 
contained  in  a Banach  space  F then  there  is  a sequence  of 

oo 

norm  one  projections  {P^}^_^  defined  on  F such  that 

1.  P.f  e E for  all  f e F and 

x — 

2.  P^g  -+■  g in  norm  for  all  g e E. 

THEOREM  2.2  E Is  an  L^-predual  if  and  only  if  for  each  e > 0 
and  each  finite  set  S c_  E,  there  is  a finite  dimensional  sub- 
space^  F c.  E such  that 

1.  F is  isometric  with  £n  for  some  n and 

00 

2.  sup{inf{||s  — f |t  : f e F}  : s e S)  < e* 

Both  results  are  known;  and,  for  example,  either  can  be 
found  in  [Lazar  and  Lindenstrauss , 1966]  or  as  variants  of 
results  there. 

We  will  use  the  rest  of  the  section  to  record  our  nota- 
tion. We  will  use  |R  and  (E  to  denote  the  real  and  complex 
fields.  For  a compact  Hausdorff  space  X,  C(X,  (E)  represents 
the  continuous  complex  valued  functions  on  X,  equipped  with 
supremum  norm,  and  ($)  is  equivalent  to  C(Y,  (E)  for  a set 

Y containing  n points.  For  a measure  space  (X,Z,p,)* 
L^(X,Z,(i,G)  consists  of  the  integrable  complex  valued  functions 
on  (X,E,y).  Analogous  definitions  hold  for  the  real 

Banach  spaces  obtained  when  replacing  (E  by  (R  above. 

For  a Banach  space  E,  E*  and  S(E)  represent  the  dual 
of  E and  the  unit  ball  of  E respectively.  The  extreme 
points  of  a convex  set  K are  the  set  ext(K).  A Banach  space 
E is  an  L^-predual  if  there  is  a measure  space  (X,Z,p.)  such 
that  E*  is  isometric  with  L^(X,Z,p.). 
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III  Proof  of  Theorem  1.1 

PROPOSITION  3-1.  Let  E be  a subspace  of  C(X,  (t)  that  r.nn- 

tains  the  constants.  If  E is  an  L, -predual  then  E is  self- 

1 - 

adjoint. 

Proof . Although  this  is  not  obvious , it  is  known,  that  we  may 
assume  that  E is  separable  and  that  E separates  the  points 
of  X. 

Let 

(3.1)  D = closure[span  ReE]  c C(X,  d)  . 

Then 

i 

(3.2)  dE  = dReE  = dReD  = dD, 

where  d denotes  the  Choquet  boundary  of  a space. 

Let  P^  be  the  projections  of  theorem  2.1.  Then 

(3.3)  (P.d)(x)  -+  d(x)  for  x e dE  and  d e D. 

} 

From  Choquet  theory  and  the  Lebesgue  dominated  convergence  , 

theorem,  this  implies  that  P^  -*■  d weakly  (since  dE  = dD). 

Since  P^d  e E we  conclude  that  D £ E and  E is  selfadjoint. 

PROPOSITION  3-2  Let  E be  a selfadjoint  subspace  of  C(X,  (I)  . 

Then  E is  a complex  L^-predual  if  and  only  if  ReE  is  a 
real  L^-predual. 

Proof  . The  proof  uses  theorem  2.2.  First  assume  E is  a 
complex  L^-predual.  Let  G be  a complex  linear  subspace  of 
C(X,  (I)  that  is  isometric  to  £^(<I),  and  suppose  f is  a 
real  function  such  that  d(f,  G)  < e.  We  will  show  there  is  a 
real  linear  subspace  S of  ReG  c E that  is  isometric  with 
£n(IR)  and  for  which  d(f,  S)  < 2e.  By  theorem  2.2,  this  will 
prove  the  result.  Since  G is  isometric  with  there  exists 
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a basis  {g^}  of  G 


(a) 


ei(V 


and  points  {x^}^_^  c.  x 

if  i = j 
if  i t j 


such  that 


and 

(b)  for  each  y e X there  are  scalars  X.  such  that 

both 

(i)  Z J | < 1 and 

(ii)  Za^g^y)  = ZX^a^  for  any  set  f scalars  a^. 

Now  let 

(3.4)  S = real  span{Reg^}^_^  c E- 
If  follows  that  the  map 


(3.5)  f -*  (f(x1),  f(x2),  ...,  f(xn)) 

is  an  isometry  of  S onto  £^(IR)-  For  if  x e X and  e IR, 

(3.6)  |Za_.Reg^  (x)  | 5 |Za^.g_.(x)|  5 |ZXia_.|  < max | a^  | • 

Now  to  verify  the  estimate  d(f,  S)  < 2e,  let  a_.eC 
be  such  that  ||f  - Za_jg  ||  < e. 

One  verifies  that 

(3.7)  J Ima_.  | 5 e,  and  ||Za_.g_.  - Z(Rea^)g^||  < e* 

Thus  if 


(3.8)  q = ReZ(Rea^)g^ , 
then 
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(3.9)  ||ZRe(a_.g^ ) - q||  < e . 

But  since  jlZRea^g^  - f||  < e,  ||q  - f||  < 2e. 

This  proves  the  case  in  which  E is  a complex  L^-predual.  In 
fact  this  shows  that  ReE  is  a real  L^-predual  even  when  E 
is  not  selfadjoint. 

If  ReE  is  a real  L^-predual  one  shows  that  the  complex 
span,  of  an  isomotric  copy  of  £^(R)  ReE  is  isometric  with 
£^(C).  The  proof  is  almost  exactly  as  above  but  shorter  since 
it  dosen't  require  computing  the  estimate  above.  This  part  is 
deleted . 
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APPROXIMATION  IN  A NORMED  SPACE 


New  foundations  and  methods 
Marc  Zamansky 

In  this  paper  we  outline  a novel  approach  to  obtaining 
convergence,  approximation,  and  saturation  results  for  certain 
classes  of  linear  approximation  processes.  A more  detailed 
(preliminary)  exposition  of  these  ideas  has  appeared  as  a 
preprint:  Zamansky,  Approximation,  XI  +84  p.,  1975,  Institut 

Henri  Poincare,  Paris.  Here  we  have  changed  the  order  and 
stated  some  results  (cf.  (4.1), (4.6),  (4.7))  in  more  general 
form. 

The  origin  of  this  theory  is  a very  simple  proof  of  the 
theorem  about  the  saturation  class  of  the  first  arithmetic 
mean  (1950)  . It  seems  that  only  Favard  in  1957  and  Aljancic 
in  1958  have  used  this  idea,  but  in  special  cases.  Here  is 
the  idea:  if  Sn  is  a sequence  of  numbers,  Tn  = (SQ  + ... 

+ Sn_^)/n  and  0 < a < 1,  then 

S - s = 0(l)/na<=>T  - T = 0(l)/na+1. 

n n+1  n 

We  begin  in  section  1 by  defining  the  classes  of  approxi- 
mations: they  are  all  sequences  p = (pn) , p > 0,  p 1 that 

satisfy  some  conditions.  The  classes  of  (linear)  processes 
of  approximation  are  defined  by  natural  conditions  in  ref- 
erence to  processes  [1  - tP]  (typical  means)  which  are 
"s  tandar d-processes . " 

We  prove  that,  under  general  conditions,  any  "usual" 
process  is  equivalent  to  a standard — process  for  a well  de- 
fined approximation  class. 

For  example,  we  apply  this  theorem  and  the  intermediate 
theorems  to  periodic  functions  and  obtain  in  a few  pages  all 
fundamental  results,  known  or  new.  It  is  superfluous  to  dis- 
tinguish convergence,  approximation,  or  saturation.  We  even 
obtain  the  best  asymptotic  constants  in  general  cases.  We 
note  that  Fourier  transforms  do  not  play  any  role. 

1 Classes  of  approximations 

An  approximation  is  a real,  positive  function  pi  of 
0 >_  0O  and  a class  of  approximations  is  the  set  of  p which 
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satisfy  some  conditions.  We  define  the  class  {A} 


(1.1)  P,„  = 0(l)p  , 0 < X < 1<=^>  Z p = 0(1) 

Xn  n Xn<k<n  k 


by 

n pn 


(where  0(1)  is  dependent  on  X,  but  is  in  n) . {A}  contains 

all  usual  approximations  but  not  exponential  approximations. 


(1.2)  Let  A -a  +...+a  ^ 0 and  a (a,p)  = 

no  n n 

n 

(§laklpk^lAnl’  Tn(a,p)  " I^ak^lAk-l^pk  which  define 

the  functions  o(a):p  ■+  a(a,p),  x(a):p  ->  x(a,  ).  We  write 
a (a)  and  x when  a^  = (k  + l)a,  ct  > -1. 

The  classes  (a(a)},  (x)  are  defined  by  a(a,p)  = 

0(l)p,  x(p)  = 0(l)p  (it  follows  that  o(a,p)  A,  p and,  if 
p e (A},  x(p)  a,  p) . 

(1.3)  We  establish  the  technical  properties  that  will  play 

a role.  For  example:  {a(a)}c  {A},  (an(a,p))  e {A}, 

o(3)  o o(a)  = 0(1)  a(inf  (a, 3)),  a( a)  o x and  to  a(a)  are 
0(1)  (a( a)  + x) , etc... 

(1.4)  Because  a( a,p)  = 0(l)p  =>  a(oc,  o(p))  = o(l)p  and,  if 

t (p)  = 0(1),  x (o(p))  = o(l)x  (p),  all  results  will  be  true 
n n n 

when  we  take  o(p)  instead  of  p. 

(1.5)  If  e > 0,  e ->-0,  there  exists  p 4-  0,  p > e 

n n n n — n 

and  p e{c(a)}  for  all  a>-l. 

A consequence  is  that  all  theorems  of  approximation  will 
be  true  if  we  consider  only  the  convergence. 

2 Classes  of  processes 

Any  (linear)  process  for  a series  Zu^  in  a normed  space 

E gives  Zc  , u.  ; therefore  we  define  a process  by  a sequence 
k n,  k k. 

(<£n)  of  functions  of  t = k/n  (n  = 1,  2,  ...,  k = 0,  1,  2,...) 

such  that  <fc  (t)  =*  c , . 

n n,  k 
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In  the  interest  of  clarity,  we  distinguish  the  processes 

[ 4>  ] or  [ 4>  ( t ) ] , where  <j>  (t)  =0  if  t < 0,  t > 1,  and 
n n n 

the  processes  [ 4>  [ or  [4>  (t)  [,  and  where  4>  (t)  =0  if 
n n n 

t < 0.  We  write  Tn^n’ujc)  = £<t>n(k/n)Uk* 

Let  = 4>n(t)  - d>n ( t + 1/n)  , A2  ( 4>n>  = ^ (Ax  C4>n>  ) , 

(2.1)  The  class  of  processes  { [ p , a , e]}  is  the  set  of 

n 

[<J>  ] such  that  the  <J>  are  uniformly  bounded  and  for  a 
n n 

finite  number  of  with  0 < n'  < b < 5^  < — 1» 

(1)  in  [n',  1],  nA^($  ) = 0(1)  uniformly  with  respect  to  t 
and  n; 

(2)  in  [n\  qL...,  ] C±»  5i+1t>  n2A2(<hn)  = 0(1)  likewise 

(3)  in  [0,  n]»  4>  (t)  = d>  (0)  - tP(a  + iji  (t))  where  p > 0, 

n n n n 

a = constant  and  for  a e > 0 and  0<  t<  (t)  = 0(1), 

^ i 2 2 c ^ 

n t Ai^n^  = 0(1) » n t A2^n^  = 0(1)  uniformly  with 

respect  to  t and  n. 


(2.2)  The  class  {[p,  a , e,  6 [}  is  the  class  of  the  pro- 


cesses [ [ which  satisfy  conditions  (2.1)  and 


b'  < t and  a 6 > 0,  t<j>  (t)  = 0(1),  n t 


6+1 


W 


(4)  for 

0(1),  n2t^+2A2(<^n)  = 0(1)  uniformly. 

(2.3)  The  standard-processes  are  [1  - tP]  and  we  will  answer 

the  question:  if  a standard-process  gives  an  approximation 

p,  what  is  the  approximation  for  [<j>n]  or  [ 4>n  [ , and  in- 
versely? 

N.B.  — If  we  compare  [ d>n ] with  [1],  the  hypotheses  which  con- 
cern A 0 ( <t>  ),  are  useless, 
z n 

(2.4)  With  some  precautions,  but  without  new  hypotheses,  the 

results  are  true  for  the  processes  d>  (k/n 

n 

erally  for  4>x  (k/x)  . 


1/P 


) or  more  gen- 
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3 General  Theorems  in  a normed  space 

We  introduce  the  notation  ([*  ];  p ) :=  T ([d>  ],  u,) 

n n n n k 

- $ (0) s - <p  (1)  (T  ( [ 1 ] , u,  ) - s)  = 0(l)p  , s e E and 
n n n k n 

( [ d>n  f » Pn)  :=  Tn(I«n[.  \)  - 4>n(0)s  = 0(l)pn. 

(3.1)  THEOREM  A.  We  have 

(3.1.1)  p e (A)  , ([1];  pr)  =>(UnJ;  (|an|  + l)on(p  - 1,  p)) 

(3.1.2)  p e (A)  , ([1  - tP ] ; p ) =>  ([*  ];  ( |a  | + i)0  (p  + 

n n n n 

e - 1,  a(p,  p))) . 

(3.1.3)  If  E <p  (k/n)cL  converges  and  p e {A},  one  has: 

n k. 

6 > 0,  ( [1  - tP] ; Pn)  =>  ( td>n  [;  ( |aj  + DcMp  + £ - l,o(p,  p) ) 

+ a (p,  p)) 
n 

6 = 0,  ( [1  - tp ] ; Pn)  =>  ( [ 4>n  [;  ( |an|  + l)on(p  + e - l,a(p,  p)) 

+ t (a(p,  p))  . 

n i / 

(3.1.4)  For  <^(k/n  p)  there  are  analogous  results. 

(3.2)  THEOREM  B.  (Principal  Part)  Let  p > 0,  q + 1 > 0, 
q + 1 # p . Then 

(1)  If  E is  normed,  ([1  - tP];  p )=>([tP  - tq+1]; 

— n 

CTn (q , p))  when  q < p,  or  q > p and  o(p)  = 0(l)a(q), 

or  q = p and  o(p)  e (o(p)}. 

(2)  If  E is  a Banach  space,  x (p)  = 0(1)  and 

( [ tp  - tq  ] ; pn)=>3s  e E and  ([1  - tP] ; xn(p))  when 

P / q and  p e {o(inf(p,  q))}  or  p e {a(p)  o a(p)}  . 

(3.3)  Examples . 

(3.3.1)  Theorem  A contains  many  know  results  about  conver- 
gence, but  here  we  have  stated  only  results  about  approxi- 

2 

mations.  If  we  consider,  for  example,  [1  - t ] o [1  - t], 
we  obtain  the  Cesaro  processes.  The  Riemann  process 
4> ( t)  = sinP  t/tP  is  a very  good  example  for  3.1.3  (B  = 0 
if  p = 2,  B > 0 if  p > 2).  For  3.1.4  we  mention  Gauss- 
Weierstrass,  de  la  Vallee-Poussin,  generalized  Abel  processes 
and  many  others. 
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(3.3.2)  The  processes  [<J>]  ('cf.  the  work  of  Sz.  Nagy  in  1948 
but  only  for  the  periodic  functions  in  the  space  C)  are  a 
special  case. 

(3.3.3)  It  is  possible  to  write  the  results  with  p = constant 
and  s = 0.  We  obtain  an  upper  bound  for  ||T  ($  , u^ll  (c^* 
theorem  N in  4.1). 

4 Approximation  in  a normed  space  possessing 


a biorthogonal  system 


We  suppose  that  the  sequence  e^  e E and  sequence  (c^) 

of  linear  forms  compose  a biorthogonal  system.  For  f e E, 

we  consider  Eu^  where  u^  = c^COe^.  We  wr^te  T^(4>^,  ^ 

instead  of  T (<f>  , u,  ) and  P = ?a.  e , = ?kma,  , m > 0. 

n n k okk  okk 

(4.1)  THEOREM  N.  If  the  operators  ^([1  - t])  (the  first 

arithmetic  means)  are  continuous  and  1 1 T ([1  - t ] ) 1 1 = 0(1), 
— - n 

then  for  [ 4>  ] e { t p , a , e]},  4>  (1)  = 0,  a_  = 0(1),  one  has 
n n n n 

T ([*  ])  = 0(1). 
n n 

(4.2)  THEOREM  C.  If  P - f = 0(p  ) and  ||P^||  = 0(l)nI"||P  ||, 

— n n " n " n" 

then 


v=a^-l 

(D  P[“jL  - P[™]+1  = 0(1)  Ek  vm-1  pv,  a integer  >_  2. 
a a o v=a  o 

{2)  ,W  . ptm]  , 0(1)  ((  + d-  + Y (V  + I)"'1  pu) 

o o v=n 

o 


(3) 


p[m^  = 0(l)nma  (m  - 1.  p)  (first  proof  in  1949) 
n n 


(4.3)  P - f = 0(p  )=>T  ([1  - tP],  f)  - f = 0(1  )o  (p  - 1,  P). 

n n n n 

(4.4)  If  $ (0)  = 1,  * (1)  = 0,  UJ  e { tP,  an,  e]},  then 

T (U  ],  f)  - f = 0(p  ) =>  a (T  ([1  - tP],  f)  - f)  = 
n T n n n n 

0(l)a  (p  - 1 + e, p) . 
n 
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(4.5)  We  suppose  that  E is  a Banach  space,  [ 4> ^ ] e 

{ [p , an,  e,  0 ] > , d>n(0)  =1.  0 < inf  |an|  £ sup  |aj  < + ®. 

Moreover,  we  suppose  for  a sufficiently  small  q > 0 and 

0 < p < t < 1,  one  has  inf  |l  - <j>  (t)  I > 0.  Finally,  we 

n,  t n 

suppose  that,  if  e = 0,  for  0 < t p one  has 

inf(|l  - d>  (t)|/tP)  > 0.  Then,  if  p e (cr(p)}  and 
n,t  n 1 

Tn(p)  = o(i) » 

Tn(  [ 4>n  f*  f)  ~ f = 0(P„)  =>Tn([l  ~ tP],  f)  - f = 0(l)Tn(p) 

(4.6)  Equivalence  between  [<f>n]  or  [ 4>n [ and  [1  - tP].  Let 

E be  a normed  linear  space  possess  a biorthogonal  system.  We 
suppose  that  the  operators  T ([1  - t])  are  continuous  and 
||Tn([l  - t ] ) 1 1 = 0(1).  We  suppose  0 < inf  | a^  | sup  | a^  j < 

+ oo  and  we  consider  the  processes  which  satisfy  (4.4)  or 
(4.5).  Then  [<j>  ] ^ [1  - tP]  for  all  p e (a(inf(p  + e - 
1,  p))};  and  if  E is  a Banach  space  [ n [ [1  - tP  ] for 

all  p e (o(inf(p  + e - 1,  P))>  H {t} 

N.B.  One  may  extend  these  results  to  other  processes,  for 
example  $ (k/n^p) 

(4.7)  Equivalence  between  [<|>  ] and  [1].  Lf 

0 < inf  U (1)|  < sup  Id)  (1)1  < + » and  p e (a(p  - 1)},  then 
1 n — n 1 

^ [1]  for  the  approximations  (|anl  l)pn- 

(4.8)  Saturation.  Saturation  takes  place  if  the  c^  are 
continuous  and  we  obtain  the  saturation  classes. 


5 Example:  approximation  of  periodic  functions 

We  give  . only  one  example.  It  is  unnecessary  to  specify 
which  usual  topology  is  used.  With  the  previous  results,  we 
find  immediately  the  differential  properties  from  p^  to  u)^, 
(m-th  modulus  of  continuity,  m integer  ^ 1)  . If  we  want  to 
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prove  the  inverse  properties,  it  is  sufficient  to  prove  only 
the  following  theorems. 

(5.1)  THEOREM  J.  For  any  integer  m 1,  there  is  a sequence 

of  polynomials  of  degree  n,  such  that  Pn  - f = 

O(l)o)m(l/n)  . (We  give  a new  proof). 

(5.2)  Approximation  by  [1  - tp]. 

If  0 < p < m,  Tn([l  - tP],  f)  - f = O(l)on(p  - 1,  u>m) 

If  p > m,  Tn([l  - tP],  f)  - f * 0(l)o)m(l/n) 

If  p - 2m,  Tn([l  - t2"],  f)  - f - 0(l)w2m(l/n) 

(5.3)  If  the  conjugate  f e E and  Tfl(p)  - 0(1),  then 
(cf.  A. 3) 

pn  * f " °<Pn)r>Tn([1  " tP]»  ? * 0(l)(on(P  - 1,  p) 

+ xn(p)). 

We  obtain  quickly  all  relations  between  derivatives  and 

approximations,  between  the  oj  of  f and  f,  between  the 

m 

um  and  the  best  approximations,  and  the  saturation  classes 
of  processes  [<f>n]  or  [$  [. 

N.B. — For  the  periodic  functions,  it  is  possible  to  give  best 
hypotheses  for  Theorem  4.6:  for  [ (resp.  [ d>n  [)  in  Lr, 

1 < r < + »,  4>n(l)  - 0(1)  (resp.  <|>n(t)  - 0(1))  and  in  the 
other  spaces,  <f>n(l)  ■ 0(l/logn)  (resp.  ij>(t)  - (1/logt), 

t -*•  + <») . 

(5.4)  ^f  e > 0,  the  results  give  the  approximations: 

for  U ]»  "from  convergence  to  saturation  inclusively": 
for  U [,  "from  approximations  such  that  t (p)  * 0(1)  to 
saturation  inclusively" 
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(5.5)  Best  asymptotic  constants.  (Example). 

THEOREM.  Let  [<J>n]  e { [p , an»  e]},  (resp.  [ [ e ([p,  an> 

e,  Bl)  where  a 0,  a = 0(1),  e > 0,  B > 0.  Let  2m 
n n 

be  an  Integer  > 2 (resp.  2m  + 1>  1). 

If  £ E an(}  p = 2m,  then 


W f)  - f + 

If  f(2m+1)  e e 

T (♦  , f)  - f + 
n n 


(ill 


m 


2m  n 


r2m 

a f * o 


fe) 


and  p = 2m  + 1,  then 

("1)m+1  a fr2m+1>  , 
2nHd  3n  f ° 


CM 


We  can  give  more  precise  equalities  when  we  have  better 
information  about  the  processes  or  when  we  use  the  u^.  Then 
we  find  in  general  cases  the  best  asymptotic  constants. 
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